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HEPATITIS C VIRUS HELICASE CRYSTALS AND MOLECULES 
COMPRISING HELICASE BINDING POCKETS 

TECHNICAL FIELD OF THE INVENTION 

The invention relates to the X-ray crystal 
structure of the hepatitis C virus helicase domain. More 
specifically, the invention relates to crystallized 
complexes of HCV helicase and an oligonucleotide, to 
crystallizable compositions of HCV helicase and an 
oligonucleotide and to methods of crystallizing an HCV 
helicase-oligonucleotide complex. The invention further 
relates to a computer programmed with the structure 
coordinate^ of the HCV helicase oligonucleotide binding 
pocket or the HCV helicase nucleotide triphosphate pocket 
wherein said computer is capable of displaying a three- 
dimensional representation of that binding pocket. 

BACKGROUND OF THE INVENTION 

Infection by the hepatitis C virus (HCV) is 
responsible for most transfusion-associated cases of non- 
A, non-B hepatitis and also accounts for a significant 
proportion of community-acquired hepatitis cases 
worldwide. Relatively few HCV infected individuals 



experience acute hepatitis, but up to 85% appear to 
develop persistent infection that often leads to chronic 
hepatitis and liver cirrhosis, eventually predisposing 
them to hepatocellular carcinoma. At present, vaccines 
are unavailable and no broadly effective therapies exist 
for this viral disease. Consequently, much research has 
focused on the HCV replicative enzymes as targets for 
more effective therapies. 

HCV contains an approximately 9.6 kb single- 
stranded positive sense RNA genome classified as its own 
genus in the Flaviviridae family of animal viruses, which 
also includes the flavivirus and pestivirus genera. Its 
genome consists of a conserved 5' nontranslated sequence 
that serves as an internal ribosomc entry site, a single 
open reading frame that encodes a polyprotein of >3000 
amino acids, and a 3' nontranslated region. The 3' 
nontranslated region contains tracts of poly(U)n and 
poly(UC)n followed by a novel conserved 98 nucleotide 
sequence . 

Proteolytic processing of the HCV polyprotein 
by virally-encoded proteases generates several 
nonstructural (NS) proteins with enzymatic activities 
essential for the replicative cycle of the virus [P. 
Neddermann et al., Biol. Chem. , 378, pp. 469-476 (1997)]. 



NS2 encodes a presumed metalloprotease, NS5B is a RNA- 
dependent RNA polymerase, and NS3 is a bifunctional 
enzyme with a serine protease localized to the N-terminal 
181 residues of the protein and a RNA helicase in the C- 
terminal 4 65 amino acids. The NS3 protease performs an 
intramolecular cleavage at the NS3/NS4A junction to form 
a tight noncovalent NS3-NS4A complex necessary for 
efficient processing of the remaining polyprotein [C. 
Failla et al . , J. Virol. , 69, pp. 1769-1777 (1995); R. 
Bartenschlager et al . , J. Virol. , 69, pp. 7519-7528 
(1995); Y. Tanji et al., J. Virol. , 69, pp. 1575-1581 

(1995) ] . To date, no evidence exists to suggest that the 
serine protease and helicase domains are separated by 
proteolytic processing of NS3 in vivo. This may reflect 
economical packaging of these enzymatic components, or 
could imply a functional interdependence between the two 
domains . 

Numerous studies have demonstrated that the 
serine protease [J. L. Kim et al . , Cell , 87, pp. 343-355 

(1996) ; W. Markland et al., J. Gen. Virol. , 78, pp. 39- 
43(1997).; C. Steinkuhler et al . , J. Virol. , 70, pp. 
6694-6700 (1996) ] and RNA helicase domains [J. A. Suzich 
et al., J. Virol. , 67, pp. 6152-6158 (1993); C. L. Tai et 
al., J. Virol. , 70, pp. 8477-8484 (1996); L. Jin et al.. 



Arch. Biochem. Biophys. , 323, pp. 47-53 (1995); and F. 
Preugschat et al., J. Biol. Chem. , 271, pp. 24449-24457 
(1996)] of NS3 can be expressed independently and 
isolated as catalytically active species. However, 
emerging evidence suggests that the NS3 protease and 
helicase domains may contact one another and modulate NS3 
catalytic activities. Examples include apparent 
differences in pH optima of ATPase and RNA unwinding 
activities between a contiguous NS3 protein complexed 
with the NS4A cofactor [K. A .Morgenstern et al . , J. 
Virol. , 71, pp. 3767-3775 (1997); Z. Hong et al., J. 
Virol. , 70, pp. 4261-4268 (1996)] and an isolated NS3 
helicase domain [C. L. Tai et al., J. Virol. , (1996), 
supra ; L. Jin et al . , Arch. Biochem. Biophys. , (1995), 
supra; F. Preugschat et al . , J. Biol. Chem. , (1996), 
supra ; and Y. Gwack et al., Biochem. Biophys. Res. 
Coimnun. , 225, pp. 654-659 (1996)]. Similarly, the ATPase 
activities of both proteins differ in their sensitivity 
to polynucleotide stimulation. Contiguous NS3 appears to 
have a lower apparent dissociation constant for poly(U) 
than does the helicase domain [J. A. Suzich et al., J. 
Virol ■ , (1993), supra ; F. Preugschat et al . , J. Biol. 
Chem. , (1996), supra ; K. A .Morgenstern et al . , J. 
Virol. , (1997), supra; A. Kanai et al . , FEBS Lett. , 376, 



pp. 221-224 (1995)]. Aside from these differences, both 
proteins display nearly indistinguishable kinetic 
parameters for NTP hydrolysis when stimulated with 
saturating polynucleotide [J. A. Suzich et al., J. 
Virol. , (1993), supra; K. A .Morgenstern et al . , J. 
Virol . , (1997), supra ] , both display 3'- 5' 
directionality for translocation along a polynucleotide 
substrate, and the helicases of both proteins effectively 
unwind duplex RNArRNA substrates [C. L. Tai et al., J. 
Virol . , (1996), supra ; Z. Hong et al . , J. Virol. , (1996), 
supra ] . 

In addition to HCV, all flavi- and pestiviruses 
sequenced to date contain conserved helicase sequence 
motifs in their homologous NS3 proteins, suggesting that 
this enzyme plays an important role in the HCV 
replicative cycle [R. H. Miller et al., Proc. Natl. Acad. 
Sci. USA , 87, pp. 2057-2061 (1990)]. Consistent with 
this possibility, helicase encoding sequences have been 
identified in other viruses and helicases are suggested 
to catalyze the separation of double-stranded nucleic 
acid structures during transcription and genome 
replication [G. Kadare et al., J. Virol . , 71, pp. 2583- 
2590 (1997)]. Previous studies with poliovirus, a 
positive-stranded RNA virus of the Picornaviridae family. 



show that mutation of conserved sequence motifs in the 2C 
helicase inhibits virus replication and proliferation [C. 
Mirzayan et al . , Virology , 189, pp. 547-555 (1992)]. 
Similar mutational studies on the helicases encoded by 
herpes simplex virus type 1 and bovine papilloma virus 
also show that these enzymes are critical for virus 
replication [P. MacPherson et al . , Virology , 204, pp. 
403-408 (1994); R. Martinez et al., J. Virol. , 66, pp. 
6735-6746 (1992)]. Thus, the ability to inhibit helicase 
activity in HCV may provide an avenue for the therapeutic 
treatment of HCV infection. 

Unfortunately, little is known about the 
details of how ATP binding and hydrolysis leads to DNA or 
RNA strand unwinding by the helicase. Two structures of 
helicases crystallized in the absence of polynucleotide, 
but, unfortunately, they have not yielded the critical 
information needed to extrapolate to an enzyme mechanism 
[N. Yao et al., Nat. Struct. Biol. , 4, pp. 463-467 
(1997); H. S. Subramanya et al.. Nature , 384, pp. 379-383 
(1996) ] . 

Thus, there is a great need to solve the 
crystal structure of the helicase complexed with an 
oligonucleotide and, in particular, to delineate the 
oligonucleotide and nucleotide triphosphate (NTP) binding 
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pockets of that enzyme. With this information, computer 
models of these binding sites can be created and 
potential inhibitors of HCV helicase can be rationally- 
designed. 

SUMMARY OF THE INVENTION 

Applicants have solved this problem by 
providing a crystallized complex of the NS3 helicase 
domain of HCV and a single stranded oligonucleotide. 
That crystal has been solved by X-ray crystallography to 
a resolution of 2.2A. Based upon that crystal structure, 
applicants have identified the key amino acid residues 
that make up the oligonucleotide binding pocket of the 
helicase . 

Thus, the invention relates to a crystallized 
complex comprising the NS3 helicase domain of hepatitis C 
virus or mutants thereof and an oligonucleotide. 

The invention also relates to crystallizable 
compositions comprising the NS3 helicase domain, either 
as an isolated polypeptide or as part of the full length 
NS3 HCV protein, or single amino acid mutants thereof, 
and an oligonucleotide. And it relates to methods of 
using such compositions to produce the aforementioned 
crystals . 
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The invention also provides the X-ray structure 
coordinates of an NS3 helicase-oligonucleotide complex. 
Those coordinates, or at least the portion that define 
the oligonucleotide binding pocket or the NTP binding 
pocket are useful in methods for designing inhibitors of 
the NS3 helicase, which in turn may be useful in treating 
HCV infection. This is another aspect of the present 
invention . 

In a related aspect, the invention provides a 
computer programmed with the coordinates of the NS3 
helicase oligonucleotide binding pocket or the NTP 
binding pocket and with a program capable of converting 
those coordinates into a three-dimensional representation 
of the binding pocket on a display connected to the 
computer . 

Finally, the invention provides a computer 
which, when programmed with at least a portion of the 
structural coordinates of HCV NS3 helicase and an X-ray 
diffraction data set of a different molecule or molecular 
complex, performs a Fourier transform of these structural 
coordinates of the helicase coordinates and then 
processes the X-ray diffraction data into structure 
coordinates of the different molecule or molecular 
complex via the process of molecular replacement. 



9 



BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 lists the atomic structure coordinates 
for NS3 helicase in complex with dUs as derived by X-ray 
diffraction from a crystal of that complex. The 
following abbreviations are used in Figure 1: 

"Atom type" refers to the element whose coordinates 
are measured. The first letter in the column defines the 
element . 

"X, Y, Z" crystallographically define the atomic 
position of the element measured. 

"B" is a thermal factor that measures movement of 
the atom around its atomic center. 

"Occ" is an occupancy factor that refers to the 
fraction of the molecules in which each atom occupies the 
position specified by the coordinates. A value of "1" 
indicates that each atom has the same conformation, i.e., 
the same position, in all molecules of the crystal. 

Figure 2 shows a diagram of a computer used to 
generate a three-dimensional graphical representation of 
a molecule or molecular complex according to this 
invention . 

Figure 3 shows a cross section of a magnetic 
storage medium. 



Figure 4 shows a cross section of a optically- 
readable data storage medium. 

Figure 5 depicts a stereo ribbon diagram of the 
overall fold of the NS3 helicase with bound d(U)8. 
Domain 1 is colored blue, domain 2 red, and domain 3 
green. The sulfate and DNA are colored yellow. 

Figure 6 depicts a superposition of domains 1 
(blue) and 2 (red) of NS3 helicase based on conserved 
secondary structure motifs with a 2.0 A C-alpha RMS 
deviation over 78 residues. Residues corresponding to 
binding to the 3' end of the oligonucleotide are depicted 
as thick lines. Also shown is the location of Trp-501. 

Figure 7, panel A, depicts the residues 
surrounding the bound sulfate superimposed on the 
phosphate binding loops of eight deoxynucleoside 
monophosphate kinases. Panel B depicts the electron 
density encompassing the bound DNA substrate. The orange 
color depicts the difference Fourier (Fq-Fc) electron 
density map calculated before DNA or water molecules were 
built into the model. The blue color depicts the final 
2Fo-Fc electron density map calculated at 2 . 2A resolution 
using the refined model. 

Figure 8 depicts the secondary structure of HCV 
NS3 helicase (indicated above the sequence) . The 
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conserved sequence motifs are underscored. The non-HCV 
N-terminal and C-terminal residues which were added 
during cloning are depicted in lower case. No density 
was observed for residues in italics. The residues are 
numbered based on their location in the NS3 protein. 

Figure 9, panel A, depicts aligned sequences of 
conserved motifs from other helicases. The motifs are 
colored similar to what was previously reported for the 
PcrA helicase to aid in comparison of the structures of 
enzymes from superf amilies I and II. Panel B depicts the 
location of conserved motifs. 

Figure 10 depicts a view into the central 
binding cleft of the NS3 helicase domain. 

Figure 11, Panel A, depicts the effect of 
enzyme concentration and incubation time on HCV NS3 
helicase. Panel B depicts the effect of incubation 
temperature on helicase activity. Panel C depicts the 
effect of pH on helicase activity. Panel D depicts the 
effect of monovalent cations on helicase activity. Panel 
E depicts the effect of ATP on helicase activity. Panel F 
depicts the effect of divalent cations on enzyme 
activity. 

Figure 12, Panel A depicts the effect of enzyme 
concentration on the binding of [^^P]-ssRNA substrate to 
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HCV NS3 helicase. Panel B depicts the dissociation of 
pre-formed NS3 helicase/ [^^P] -labeled ssRNA complex by 
[^H] -labeled ssRNA over time. Panel C depicts the effect 
of pH on the binding of ssRNA to helicase. Panel D 
depicts the effect of monovalent cation on ssRNA binding 
to helicase. Panel E depicts the effect of divalent 
cations on ssRNA binding to helicase. 

Figure 13 Panel A depicts the effect of poly 
(U) on the ATPase activity of HCV NS3 helicase. Panel B 
depicts the effect of enzyme concentration on ATPase 
activity in the presence or absence of poly (U) . 

DETAILED DESRIPTION OF THE INVENTION 

In order that the invention described herein 
may be more fully understood, the following detailed 
description is set forth. 

According to one embodiment, the invention 
provides a crystallizable composition comprising NS3 
helicase and an oligonucleotide. 

The NS3 helicase protein in the crystallizable 
complexes of this invention is selected from the isolated 
helicase domain from any strain or the consensus sequence 
of the HCV NS3 protein (e.g., amino acids 167-631 of SEQ 



- 13 - 

ID N0:2); the entire NS3 protein from any strain of HCV 
or the consensus sequence of that protein (e.g., SEQ ID 
NO: 2); any portion of the NS3 protein that contains a 
functional helicase domain, which has been indicated to 
be amino acids 183-582 by C. Hyun-Soo et al., J. Biol. 
Chem. r 273, pp. 15045-15052 (1998), from any strain of 
HCV or the consensus sequence of that protein (e.g., 
amino acids 183-582 of SEQ ID NO: 2, amino acids 167-631 
of SEQ ID N0:2, amino acids 183-631 of SEQ ID N0:2) and 
amino acid mutants of any of the above (including, but 
not limited to, SEQ ID NO: 2 or any portion thereof that 
includes amino acids 183-582 of SEQ ID NO: 2 and contains 
one or more of the following single amino acid 
replacements: Ser231-to-Ala, Thr269-to-Ala, Ser370-to- 
Ala, Thr411-to-Ala, Trp501-to-Phe, Trp501-to-Leu or 
Trp501-to-Ala, Gln460-to-Ala, Arg4 61-to-Ala , Arg462-to- 
Ala, Arg464-to-Ala, or Arg4 67-to-Ala) . 

The NS3 protein utilized in the crystalli zable 
compositions of this invention may also contain 
additional amino acids at the N- and/or C-terminus which 
may be useful in purifying the protein when produced 
recombinantly . For example, we have found that a poly- 
histidine tag at the C-terminus is useful in purifying 
NS3 proteins produced in recombinant host cells through 



- 14 - 

the use of appropriate resins, such as Q-Sepharose 
(Pharmacia, Piscataway, NJ) . Such tags may also be 
useful in increasing the solubility of the NS3 protein. 

The second component in these compositions is 
an oligonucleotide. Preferably, the oligonucleotide is 
single-stranded, although double-stranded 

oligonucleotides may be used and subsequently dissociated 
prior to crystallization. Preferably, the 
oligonucleotide is a polynucleotide of between about 6 
and 20 bases. More preferably, the oligonucleotide is 
between about 6 and 12 bases. Most preferably, the 
oligonucleotide is polyuracil 8 nucleotides long (dUs) • 

The molar ratio of NS3 helicase to 
oligonucleotide should be about 1:1, although ranges 
between 1:5 and 5:1 are acceptable. 

The buffers and other reagents present in the 
crystallizable compositions of this application may be 
any components that promote crystallization and/or are 
compatible with crystallization conditions. An example 
of such a buffer condition is 15 mM MES (pH 6.5), 2 . 5 mM 
fi>-mercaptoethanol . 

The invention also relates to crystals of NS3 
helicase complexed with an oligonucleotide. Both the NS3 
helicase component and the oligonucleotide component are 
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the same as those described above for crystallizable 
compositions. These crystals are obtained from the above 
described compositions by standard crystallization 
protocols, such as the protocol exemplified in the 
Example section below. 

The invention also relates to a method of 
making NS3 helicase-containing crystals. Such methods 
comprise the steps of: 

a) obtaining a crystallizable composition 
comprising an NS3 helicase protein and an oligonucleotide 
in a molar ratio of between 1:5 and 5:1; and 

b) subjecting said composition to conditions 
which promote crystallization. 

Again, the choice for the NS3 helicase protein 
and the oligonucleotide utilized in the above 
crystallization method are the same as those set forth 
above for crystallizable compositions. 

As mentioned above, applicants have solved the 
three-dimensional X-ray crystal structure of an NS3 
helicase-dUs complex. The atomic coordinate data is 
presented in Figure 1. 

In order to use the structure coordinates 
generated for the NS3 helicase-dUg complex or its NTP or 
oligonucleotide binding pockets or portions or homologues 
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thereof, it is often times necessary to convert them into 
a three-dimensional shape. This is achieved through the 
use of commercially available software that is capable of 
generating three-dimensional graphical representations of 
molecules or portions thereof from a set of structure 
coordinates , 

Binding pockets, also referred to as binding 
sites in the present invention, are of significant 
utility in fields such as drug discovery. The 
association of natural ligands or substrates with the 
binding pockets of their corresponding receptors or 
enzymes is the basis of many biological mechanisms of 
action. Similarly, many drugs exert their biological 
effects through association with the binding pockets of 
receptors and enzymes. Such associations may occur with 
all or any parts of the binding pocket. An understanding 
of such associations will help lead to the design of 
drugs having more favorable associations with their 
target receptor or enzyme, and thus, improved biological 
effects. Therefore, this information is valuable in 
designing potential inhibitors of the binding sites of 
biologically important targets. 

The term "binding pocket", as used herein, 
refers to a region of a molecule or molecular complex. 
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that, as a result of its shape, favorably associates with 
another chemical entity or compound. 

Applicants have identified three binding 
pockets which are good targets for designing inhibitors. 
Two of these binding pockets reside in the region of the 
helicase where an oligonucleotide binds. These pockets 
are designated U4 and U8, based upon the nucleotide of dUg 
that lies in this pocket in an WS3 helicase-dUs complex. 
The third binding pocket is the NTP binding pocket. 
While this binding pocket has been partially described by 
others [T. Yao et al . , Nat. Struct. Biol. , 4, pp. 463-467 
(1997)], applicants have further defined this pocket in a 
way that was not derivable from what was known in the 
art. 

The terms "U4-, U8- and NTP-like binding 
pocket", as used herein, refer to a portion of a molecule 
or molecular complex whose shape is sufficiently similar 
to the NS3 helicase U4, U8 and NTP binding pocket, so as 
to bind common ligands. These commonalties of shape are 
defined by a root mean square deviation from the 
structure coordinates of the backbone atoms of the amino 
acids that make up these binding pockets in the NS3 
helicase structure (as set forth in Figure 1) of not more 
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than 1.5 A. The method of performing this calculation is 
described below. 

In resolving the crystal structure of NS3 
helicase in complex with an oligonucleotide, applicants 
have determined that NS3 amino acids Val232, Thr254, 
Gly255, Thr269, Gly271, Lys272, Ala275, TrpSOl and Tyr502 
form close contacts (<4A) with U8 of dUs in the NS3 
helicase-dUs complex. Thus, a binding pocket defined by 
the structural coordinates of those amino acids, as set 
forth in Figure 1; or a binding pocket whose root mean 
square deviation from the structure coordinates of the 
backbone atoms of those amino acids of not more than 1.5 A 
is considered a UB-like binding pocket of this invention. 

Applicants have also determined that in 
addition to the NS3 amino acids set forth above, Pro230, 
Val256, Thr298, Ala497, Lys551, Gln552, Gly554, Glu555, 
Asn556 and Pro558 are within 8A of U8 of the bound 
oligonucleotide and therefore are also close enough to 
interact with that substrate. Thus, in a preferred 
embodiment, a binding pocket defined by the structural 
coordinates of the amino acids that are within 8A of US 
of the bound oligonucleotide, as set forth in Figure 1 ; 
or a binding pocket whose root mean square deviation from 
the structure coordinates of the backbone atoms of those 
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amino acids is not more than 1.5 A is considered a 
preferred NS3 helicase U8-like binding pocket of this 
invention . 

Applicants have further determined that the NS3 
helicase amino acids that define the shape of the U4 
oligonucleotide binding pocket are: His369, SerSVO, 
Lys371, Tyr392, Arg393, Thr411, Asp412, Ala413, Cys431, 
Val432, Gln434, Ile446, Thr448, Arg461, Glu493, Glu555, 
Asn556 and Phe557. Thus, a binding pocket defined by the 
structural coordinates of these amino acids, as set forth 
in Figure 1; or a binding pocket whose root mean square 
deviation from the structure coordinates of the backbone 
atoms of these amino acids is not more than 1.5 A is 
considered a NS3 helicase U4-like binding pocket of this 
invention . 

Applicants have also more completely 
determined, as compared to the prior art, the NS3 
helicase amino acids that define the shape of the NTP 
binding pocket. Those amino acids are: Pro205, Thr206, 
Gly207, Ser208, Gly209, Lys210, Ser211, Thr212, Lys213, 
Asn229, Ala234, Gly237, Phe238, Tyr241, Asp290, Glu291, 
His293, Thr322, Ala323, Thr324, Gln460, Gly463, Arg464 
and Arg4 67. Thus, a binding pocket defined by the 
structural coordinates of these amino acids, as set forth 
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in Figure 1; or a binding pocket whose root mean square 
deviation from the structure coordinates of the backbone 
atoms of these amino acids is not more than 1.5 A is 
considered a NS3 helicase NTP-like binding pocket of this 
invention . 

It will be readily apparent to those of skill 
in the art that the numbering of amino acids in other 
isoforms of NS3 may be different than that set forth for 
herein. Corresponding amino acids in other isoforms of 
NS3 are easily identified by visual inspection of the 
amino acid sequences or by using commercially available 
homology software programs. 

Each of those amino acids of NS3 helicase is 
defined by a set of structure coordinates set forth in 
Figure 1. The term "structure coordinates" refers to 
Cartesian coordinates derived from mathematical equations 
related to the patterns obtained on diffraction of a 
monochromatic beam of X-rays by the atoms (scattering 
centers) of a protein or protein-ligand complex in 
crystal form. The diffraction data are used to calculate 
an electron density map of the repeating unit of the 
crystal. The electron density maps are then used to 
establish the positions of the individual atoms of the 
enzyme or enzyme complex. 
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Those of skill in the art understand that a set 
of structure coordinates for an enzyme or an enzyme- 
complex or a portion thereof, is a relative set of points 
that define a shape in three dimensions. Thus, it is 
possible that an entirely different set of coordinates 
could define a similar or identical shape. Moreover, 
slight variations in the individual coordinates will have 
little effect on overall shape. In terms of binding 
pockets, these variations would not be expected to 
significantly alter the nature of ligands that could 
associate with those pockets. 

The term "associating with" refers to a 
condition of proximity between a chemical entity or 
compound, or portions thereof, and a binding pocket or 
binding site on a protein. The association may be non- 
covalent — wherein the juxtaposition is energetically 
favored by hydrogen bonding or van der Waals or 
electrostatic interactions — or it may be covalent. 

The variations in coordinates discussed above 
may be generated because of mathematical manipulations of 
the NS3 helicase-oligonucleotide complex structure 
coordinates. For example, the structure coordinates set 
forth in Figure 1 could be manipulated by 
crystallographic permutations of the structure 
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coordinates, f ractionalization of the structure 
coordinates, integer additions or subtractions to sets of 
the structure coordinates, inversion of the structure 
coordinates or any combination of the above. 

Alternatively, modifications in the crystal 
structure due to mutations, additions, substitutions, 
and/or deletions of amino acids, or other changes in any 
of the components that ma\e up the crystal could also 
account for variations in structure coordinates. If such 
variations are within an acceptable standard error as 
compared to the original coordinates, the resulting 
three-dimensional shape is considered to be the same. 
Thus, for example, a ligand that bound to the 
oligonucleotide binding pocket of NS3 helicase would also 
be expected to bind to another binding pocket whose 
structure coordinates defined a shape that fell within 
the acceptable error. 

Various computational analyses are therefore 
necessary to determine whether a molecule or the binding 
pocket portion thereof is sufficiently similar to the NS3 
helicase binding pockets described above. Such analyses 
may be carried out in well known software applications, 
such as the Molecular Similarity application of QUANTA 
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(Molecular Simulations Inc., San Diego, CA) version 4.1, 
and as described in the accompanying User's Guide. 

The Molecular Similarity application permits 
comparisons between different structures, different 
conformations of the same structure, and different parts 
of the same structure. The procedure used in Molecular 
Similarity to compare structures is divided into four 
steps: 1) load the structures to be compared; 2) define 
the atom equivalences in these structures; 3) perform a 
fitting operation; and 4) analyze the results. 

Each structure is identified by a name. One 
structure is identified as the target (i.e., the fixed 
structure) ; all remaining structures are working 
structures (i.e., moving structures). Since atom 
equivalency within QUANTA is defined by user input, for 
the purpose of this invention we will define equivalent 
atoms as protein backbone atoms (N, Ca, C and O) for all 
conserved residues between the two structures being 
compared. We will also consider only rigid fitting 
operations . 

When a rigid fitting method is used, the 
working structure is translated and rotated to obtain an 
optimum fit with the target structure. The fitting 
operation uses an algorithm that computes the optimum 
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translation and rotation to be applied to the moving 
structure, such that the root mean square difference of 
the fit over the specified pairs of equivalent atom is an 
absolute minimum. This number, given in angstroms, is 
reported by QUANTA. 

For the purpose of this invention, any molecule 
or molecular complex or binding pocket thereof that has a 
root mean square deviation of '-onserved residue backbone 
atoms (N, Ca, C, 0) of less than 1.5 A when superimposed 
on the relevant backbone atoms described by structure 
coordinates listed in Figure 1 are considered identical. 
More preferably, the root mean square deviation is less 
than l.OA. 

The term "root mean square deviation" means the 
square root of the arithmetic mean of the squares of the 
deviations from the mean. It is a way to express the 
deviation or variation from a trend or object. For 
purposes of this invention, the "root mean square 
deviation" defines the variation in the backbone of a 
protein from the backbone of NS3 helicase or a binding 
pocket portion thereof, as defined by the structure 
coordinates of NS3 helicase described herein. 

Therefore, according to another embodiment of 
this invention is provided a computer for producing: 
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a) a three-dimensional representation of a molecule or 
molecular complex, wherein said molecule or molecular 
complex comprises a binding pocket defined by 
structure coordinates of NS3 amino acids Val232, 
Thr254, Gly255, Thr269, Gly271, Lys272, Ala275, 
TrpSOl and Tyr502 according to Figure 1; or 

b) a three-dimensional representation of a homologue of 
said molecule or molecular complex, wherein said 
homologue comprises a binding pocket that has a root 
mean square deviation from the backbone atoms of said 
amino acids of not more than 1 . 5A, wherein said 
computer comprises: 

(i) a machine-readable data storage medium 
comprising a data storage material encoded with machine- 
readable data, wherein said data comprises the structure 
coordinates of NS3 amino acids Val232, Thr254, Gly255, 
Thr269, Gly271, Lys272, Ala275, TrpSOl and Tyr502 
according to Figure 1; 

(ii) a working memory for storing instructions for 
processing said machine-readable data; 

(iii) a central-processing unit coupled to said 
working memory and to said machine-readable data storage 
medium for processing said machine readable data into 
said three-dimensional representation; and 
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(iv) a display coupled to said central-processing 
unit for displaying said three-dimensional 
representation. 

According to a preferred embodiment, the 
computer produces a three-dimensional representation of: 

a) a molecule or molecular complex comprising a binding 
pocket defined by the structure coordinates of NS3 
helicase amino acids Val232, Thr25 1, Gly255, Thr269, 
Gly271, Lys272, Ala275, TrpSOl, Tyr502, Pro230, 
Val256, Thr298, Ala497, Lys551, Gln552, Gly554, 
Glu555, Asn556 and Pro558, according to Figure 1; or 

b) a homologue of said molecule or molecular complex, 
wherein said homologue comprises a binding pocket that 
has a root mean square deviatioxi from the backbone 
atoms of said amino acids of not more than 1.5A. In 
that preferred embodiment, the machine readable data 
comprises the structure coordinates of NS3 amino acids 
Val232, Thr254, Gly255, Thr269, Gly271, Lys272, 
Ala275, TrpSOl, Tyr502, Pro230, Val256, Thr298, 
Ala497, Lys551, Gln552, Gly554, Glu555, Asn556 and 
Pro558, according to Figure 1. 

In the above two embodiments, the computer is 
producing a three-dimensional graphical structure of a 
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molecule or a molecular complex which comprises a NS3 
helicase U8-like binding pocket. 

In an alternate embodiment, the computer 
produces a three-dimensional representation of: 

a) a molecule or molecular complex comprising a binding 
pocket defined by the structure coordinates of NS3 
helicase amino acids His369, Ser370, Lys371, Tyr392, 
Arg393, Thr411, Asp412, Ala413, Cys431, Val432, 
Gln434, Ile446, Thr448, Arg461, Glu493, Glu555, Asn556 
and Phe557, according to Figure 1; or 

b) a homologue of said molecule or molecular complex, 
wherein said homologue comprises a binding pocket that 
has a root mean square deviation from the backbone 
atoms of said amino acids of not more than 1 . 5A 

In this alternate embodiment, the machine 
readable data comprises the structure coordinates of NS3 
amino acids His369, Ser370, Lys371, Tyr392, Arg393, 
Thr411, Asp412, Ala413, Cys431, Val432, Gln434, Ile446, 
Thr448, Arg461, Glu493, Glu555, Asn556 and Phe557, 
according to Figure 1. 

In this embodiment, the computer is producing a 
three-dimensional graphical structure of a molecule or a 
molecular complex which comprises a NS3 helicase U4-like 
binding pocket. 
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In yet another alternate embodiment, the 
computer produces a three-dimensional representation of: 

a) a molecule or molecular complex comprising a binding 
pocket defined by the structure coordinates of NS3 
helicase amino acids Pro205, Thr206, Gly207, Ser208, 
Gly209, Lys210, Ser211, Thr212, Lys213, Asn229, 
Ala234, Gly237, Phe238, Tyr241, Asp290, Glu291, 
His293, Thr322, Ala323, Thr324, Gln460, Gly463, Arg464 
and Arg4 67, according to Figure 1; or 

b) a homologue of said molecule or molecular complex, 
wherein said homologue comprises a binding pocket that 
has a root mean square deviation from the backbone 
atoms of said amino acids of not more than 1 . SA 

In this alternate embodiment, the machine 
readable data comprises the structure coordinates of NS3 
amino acids Pro205, Thr206, Gly207, Ser208, Gly209, 
Lys210, Ser211, Thr212, Lys213, Asn229, Ala234, Gly237, 
Phe238, Tyr241, Asp290, Glu291, His293, Thr322, Ala323, 
Thr324, Gln460, Gly463, Arg464 and Arg467, according to 
Figure 1. 

In this embodiment, the computer is producing a 
three-dimensional graphical structure of a molecule or a 
molecular complex which comprises a NS3 helicase NTP-like 
binding pocket. 
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Even more preferred is a computer for producing 
a three-dimensional representation of a molecule or 
molecular complex defined by structure coordinates of all 
of the NS3 amino acids set forth in Figure 1, or a three- 
dimensional representation of a homologue of said 
molecule or molecular complex, wherein said homologue 
comprises a binding pocket that has a root mean square 
deviation from the backbone atoms of said amino acids of 
not more than I.SA. In this embodiment, the machined 
readable data contains the coordinates of all of the NS3 

According to an alternate embodiment, the 
invention provides a computer for determining at least a 
portion of the structure coordinates corresponding to X- 
ray diffraction data obtained from a molecule or 
molecular complex, wherein said computer comprises: 

(a) a machine-readable data storage medium 
comprising a data storage material encoded with machine- 
readable data, wherein said data comprises at least a 
portion of the structural coordinates of NS3 helicase 
according to Figure 1; 

(b) a machine-readable data storage medium 
comprising a data storage material encoded with machine- 
readable data, wherein said data comprises X-ray 
diffraction data from said molecule or molecular complex; 
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(c) a working memory for storing instructions for 
processing said machine-readable data of (a) and (b) ; 

(d) a central-processing unit coupled to said 
working memory and to said machine-readable data storage 
medium of (a) and (b) for performing a Fourier transform 
of the machine readable data of (a) and for processing 
said machine readable data of (b) into structure 
coordinates; and 

(e) a display coupled to said central-processing 
unit for displaying said structure coordinates of said 
molecule or molecular complex. 

For example, the Fourier transform of the 
structure coordinates set forth in Figure 1 may be used 
to determine at least a portion of the structure 
coordinates of other helicases. 

Figure 2 demonstrates one version of these 
embodiments. System 10 includes a computer 11 comprising 
a central processing unit ("CPU") 20, a working memory 22 
which may be, e.g., RAM (random-access memory) or "core" 
memory, mass storage memory 24 (such as one or more disk 
drives or CD-ROM drives) , one or more cathode-ray tube 
("CRT") display terminals 26, one or more keyboards 28, 
one or more input lines 30, and one or more output lines 
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40, all of which are interconnected by a conventional bi- 
directional system bus 50. 

Input hardware 36, coupled to computer 11 by 
input lines 30, may be implemented in a variety of ways. 
Machine-readable data of this invention may be inputted 
via the use of a modem or modems 32 connected by a 
telephone line or dedicated data line 34. Alternatively 
or additionally, the input hardware 3 6 may comprise CD- 
ROM drives or disk drives 24. In conjunction with 
display terminal 26, keyboard 28 may also be used as an 
input device. 

Output hardware 4 6, coupled to computer 11 by 
output lines 40, may similarly be implemented by 
conventional devices. By way of example, output hardware 
4 6 may include CRT display terminal 26 for displaying a 
graphical representation of a binding pocket of this 
invention using a program such as QUANTA as described 
herein. Output hardware might also include a printer 42, 
so that hard copy output may be produced, or a disk drive 
24, to store system output for later use. 

In operation, CPU 20 coordinates the use of the 
various input and output devices 36, 4 6, coordinates data 
accesses from mass storage 24 and accesses to and from 
working memory 22, and determines the sequence of data 
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processing steps. A number of programs may be used to 
process the machine-readable data of this invention. 
Such programs are discussed in reference to the 
computational methods of drug discovery as described 
herein. Specific references to components of the 
hardware system 10 are included as appropriate throughout 
the following description of the data storage medium. 

Figure 3 shows a cr^^sj section of a magnetic 
data storage medium 100 which can be encoded with a 
machine-readable data that can be carried out by a system 
such as system 10 of Figure 2. Medium 100 can be a 
conventional floppy diskette or hard disk, having a 
suitable substrate 101, which may be conventional, and a 
suitable coating 102, which may be conventional, on one 
or both sides, containing magnetic domains (not visible) 
whose polarity or orientation can be altered 
magnetically. Medium 100 may also have an opening (not 
shown) for receiving the spindle of a disk drive or other 
data storage device 24. 

The magnetic domains of coating 102 of medium 
100 are polarized or oriented so as to encode in manner 
which may be conventional, machine readable data such as 
that described herein, for execution by a system such as 
system 10 of Figure 2. 
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Figure 4 shows a cross section of an optically- 
readable data storage medium 110 which also can be 
encoded with such a machine-readable data, or set of 
instructions, which can be carried out by a system such 
as system 10 of Figure 2. Medium 110 can be a 
conventional compact disk read only memory (CD-ROM) or a 
rewritable medium such as a magneto-optical disk which is 
optically readable and magneto-opt ically writable. 
Medium 100 preferably has a suitable substrate 111, which 
may be conventional, and a suitable coating 112, which 
may be conventional, usually of one side of substrate 
111. 

In the case of CD-ROM, as is well known, 
coating 112 is reflective and is impressed with a 
plurality of pits 113 to encode the machine-readable 
data. The arrangement of pits is read by reflecting 
laser light off the surface of coating 112. A protective 
coating 114, which preferably is substantially 
transparent, is provided on top of coating 112, 

In the case of a magneto-optical disk, as is 
well known, coating 112 has no pits 113, but has a 
plurality of magnetic domains whose polarity or 
orientation can be changed magnetically when heated above 
a certain temperature, as by a laser (not shown) . The 
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orientation of the domains can be read by measuring the 
polarization of laser light reflected from coating 112. 
The arrangement of the domains encodes the data as 
described above. 

Thus, in accordance with the present invention, 
X-ray coordinate data capable of being processed into a 
three dimensional graphical display of a molecule or 
molecular complex which comprises a i:S3 helicase-like 
binding pocket is stored in a machine-readable storage 
medium. 

The NS3 helicase X-ray coordinate data, when 
used in conjunction with a computer programmed with 
software to translate those coordinates into the 3- 
dimensional structure of a molecule or molecular complex 
comprising a NS3 helicase-like binding pocket may be used 
for a variety of purposes, such as drug discovery. 

For example, the structure encoded by the data 
may be computationally evaluated for its ability to 
associate with chemical entities. Chemical entities that 
associate with NS3 helicase may inhibit that enzyme, and 
are potential drug candidates. Alternatively, the 
structure encoded by the data may be displayed in a 
graphical three-dimensional representation on a computer 
screen. This allows visual inspection of the structure. 
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as well as visual inspection of the structure's 
association with chemical entities - 

Thus, according to another embodiment, the 
invention relates to a method for evaluating the 
potential of a chemical entity to associate with 

a) a molecule or molecular complex comprising a binding 
pocket defined by structure coordinates of NS3 
helicase amino acids Val232, Thr254, Gly255, Thr269, 
Gly271, Lys272, Ala275, TrpSOl and Tyr502 according to 
Figure 1, or 

b) a homologue of said molecule or molecular complex, 
wherein said homologue comprises a binding pocket that 
has a root mean square deviation from the backbone 
atoms of said amino acids of not more than 1.5A. 

This method comprises the steps of: 

i) employing computational means to perform a fitting 
operation between the chemical entity and a binding 
pocket of the molecule or molecular complex; and 

ii) analyzing the results of said fitting operation to 
quantify the association between the chemical entity and 
the binding pocket. 

The term "chemical entity", as used herein, 
refers to chemical compounds, complexes of at least two 
chemical compounds, and fragments of such compounds or 
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complexes . 

Preferably, the method evaluates the potential 
of a chemical entity to associate with 

a) a molecule or molecular complex comprising a binding 
pocket defined by structure coordinates of NS3 
helicase amino acids Val232, Thr254, Gly255, Thr269, 
Gly271, Lys272, Ala275, TrpSOl, Tyr502, Pro230, 
Val256, Thr298, Ala497, Lys551, Gln552, Gly554, 
Glu555, Asn556 and Pro558 according to Figure 1, or 

b) a homologue of said molecule or molecular complex, 
wherein said homologue comprises a binding pocket 
that has a root mean square deviation from the 
backbone atoms of said amino acids of not more than 
1. sA. 

These embodiments relate to evaluating the potential of a 
chemical entity to associate with a NS3 helicase U8-like 
binding pocket. 

In an alternate embodiment, the same steps 
indicated above are used in a method for evaluating the 
potential of a chemical entity to associate with 
a) a molecule or molecular complex comprising a binding 
pocket defined by structure coordinates of NS3 
helicase amino acids His369, Ser370, Lys371, Tyr392, 
Arg393, Thr411, Asp412, Ala413, Cys431, Val432, 



Gln434, Ile446, Thr448, Arg461, Glu493, Glu555, 
Asn556 and Phe557 according to Figure 1, or 
b) a homologue of said molecule or molecular complex, 
wherein said homologue comprises a binding pocket 
that has a root mean square deviation from the 
backbone atoms of said amino acids of not more than 
1. 5A. 

These embodiments relate to evaluating the potential of a 
chemical entity to associate with a NS3 helicase U4-like 
binding pocket. 

In yet another alternate embodiment, the same 
steps indicated above are used in a method for evaluating 
the potential of a chemical entity to associate with 

a) a molecule or molecular complex comprising a binding 
pocket defined by structure coordinates of NS3 
helicase amino acids Pro205, Thr206, Gly207, Ser208, 
Gly209, Lys210, Ser211, Thr212, Lys213, Asn229, 
Ala234, Gly237, Phe238, Tyr241, Asp290, Glu291, 
His293, Thr322, Ala323, Thr324, Gln460, Gly463, 
Arg4 64 and Arg4 57 according to Figure 1, or 

b) a homologue of said molecule or molecular complex, 
wherein said homologue comprises a binding pocket 
that has a root mean square deviation from the 
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backbone atoms of said amino acids of not more than 
1. 5A. 

These embodiments relate to evaluating the potential of a 
chemical entity to associate with a NS3 helicase NTP-like 
binding pocket. 

Even more preferably, the method evaluates the 
potential of a chemical entity to associate with a 
molecule or molecular complex defined by structure 
coordinates of all of the NS3 helicase amino acids, as 
set forth in Figure 1, or a homologue of said molecule or 
molecular complex having a root mean square deviation 
from the backbone atoms of said amino acids of not more 
than I.SA, 

Alternatively, the structural coordinates of 

the NS3 helicase binding pocket can be utilized in a 

method for identifying a potential agonist or antagonist 

of a molecule comprising a NS3 helicase-like binding 

pocket. This method comprises the steps of: 

a. using the atomic coordinates of 
Val232, Thr254, Gly255, Thr269, Gly271, Lys272, Ala275, 
TrpSOl and Tyr502 according to Figure 1 ± a root mean 
square deviation from the backbone atoms of said amino 
acids of not more than 1.5A, to generate a 
three-dimensional structure of molecule comprising a NS3 
helicase-like binding pocket; 
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b. employing said three-dimensional 
structure to design or select said potential agonist or 
antagonist ; 

c. synthesizing said agonist or 

antagonist; and 

d. contacting said agonist or antagonist 
with said molecule to determine the ability of said 
potential agonist or antagonist to interact with said 
molecule . 

More preferred is when the atomic coordinates 
of Val232, Thr254, Gly255, Thr269, Gly271, Lys272, 
Ala275, TrpSOl, Tyr502, Pro230, Val256, Thr298, Ala497, 
Lys551, Gln552, Gly554, Glu555, Asn556 and Pro558 
according to Figure 1 + a root mean square deviation from 
the backbone atoms of said amino acids of not more than 
I.5A, are used to generate a three-dimensional structure 
of molecule comprising a NS3 helicase-like binding 
pocket . 

These methods are designed to identify agonists 
and antagonists that associate with an NS3 helicase US- 
like binding pocket. 

Alternatively, the atomic coordinates of the 
NS3 helicase U4 binding pocket -- His369, Ser370, Lys371, 
Tyr392, Arg393, Thr411, Asp412, Ala413, Cys431, Val432, 
Gln434, Ile446, Thr448, Arg461, Glu493, Glu555, Asn556 
and Phe557 according to Figure 1 — ± a root mean square 
deviation from the backbone atoms of said amino acids of 
not more than 1.5A, may be used in step a), above, to 
generate a three-dimensional structure of molecule 
comprising a NS3 helicase-like binding pocket. 

In another alternative embodiment, the atomic 
coordinates of the NS3 helicase NTP binding site — 
Pro205, Thr206, Gly207, Ser208, Gly209, Lys210, Ser211, 
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Thr212, Lys213, Asn229, Ala234, Gly237, Phe238, Tyr241, 
Asp290, Glu291, His293, Thr322, Ala323, Thr324, Gln460, 
Gly463, Arg464 and Arg467 according to Figure 1 — ± a 
root mean square deviation from the backbone atoms of 
said amino acids of not more than l.sA, may be used in 
step a) , above, to generate a three-dimensional structure 
of molecule comprising a NS3 helicase-like binding 
pocket . 

Most preferred is when the atomic coordinates 
of all the amino acids of NS3 helicase according to 
Figure 1 ± a root mean square deviation from the backbone 
atoms of said amino acids of not more than 1.5A, are used 
to generate a three-dimensional structure of molecule 
comprising a NS3 helicase-like binding pocket. 

For the first time, the present invention 
permits the use of molecular design techniques to 
identify, select and design chemical entities, including 
inhibitory compounds, capable of binding to NS3 helicase- 
like binding pockets - in particular, the oligonucleotide 
binding pocket of NS3 helicase. 

Applicants' elucidation of the U4 and U8 

binding pockets in the oligonucleotide binding site and 
an expanded elucidation of the NTP binding pocket on NS3 
helicase provides the necessary information for designing 
new chemical entities and compounds that may interact 
with NS3 helicase-like binding pockets, in whole or in 
part . 

Throughout this section, discussions about the 
ability of an entity to bind to, associate with or 
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inhibit a NS3 helicase-like binding pocket refers to 
features of the entity alone. Assays to determine if a 
compound binds to NS3 helicase are well known in the art 
and are exemplified below. 

The design of compounds that bind to or inhibit 
NS3 helicase-like binding pockets according to this 
invention generally involves consideration of two 
factors. First, the entity must be capable of physically 
and structurally associating with parts or all of the NS3 
helicase-like binding pockets. Non-covalent molecular 
interactions important in this association include 
hydrogen bonding, van der Waals interactions, hydrophobic 
interactions and electrostatic interactions. 

Second, the entity must be able to assume a 
conformation that allows it to associate with the NS3 
helicase-like binding pocket directly. Although certain 
portions of the entity will not directly participate in 
these associations, those portions of the entity may 
still influence the overall conformation of the molecule. 
This, in turn, may have a significant impact on potency. 
Such conformational requirements include the overall 
three-dimensional structure and orientation of the 
chemical entity in relation to all or a portion of the 
binding pocket, or the spacing between functional groups 
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of an entity comprising several chemical entities that 
directly interact with the NS3 helicase-like binding 
pocket or homologues thereof. 

The potential inhibitory or binding effect of a 
chemical entity on a NS3 helicase-like binding pocket may 
be analyzed prior to its actual synthesis and testing by 
the use of computer modeling techniques. If the 
theoretical structure of the given entity suggests 
insufficient interaction and association between it and 
the NS3 helicase-like binding pocket, testing of the 
entity is obviated. However, if computer modeling 
indicates a strong interaction, the molecule may then be 
synthesized and tested for its ability to bind to a NS3 
helicase-like binding pocket. Th_j may be achieved by 
testing the ability of the molecule to inhibit NS3 
helicase using the assays described in Example 5. In 
this manner, synthesis of inoperative compounds may be 
avoided. 

A potential inhibitor of a NS3 helicase-like 
binding pocket may be computationally evaluated by means 
of a series of steps in which chemical entities or 
fragments are screened and selected for their ability to 
associate with the NS3 helicase-like binding pockets. 
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One skilled in the art may use one of several 
methods to screen chemical entities or fragments for 
their ability to associate with a NS3 helicase-like 
binding pocket. This process may begin by visual 
inspection of, for example, a NS3 helicase-like binding 
pocket on the computer screen based on the NS3 helicase 
structure coordinates in Figure 1 or other coordinates 
which define a similar shape generated from the machine- 
readable storage medium. Selected fragments or chemical 
entities may then be positioned in a variety of 
orientations, or docked, within that binding pocket as 
defined supra . Docking may be accomplished using 
software such as Quanta and Sybyl, followed by energy 
minimization and molecular dynamics with standard 
molecular mechanics force fields, such as CHARMM and 
AMBER. 

Specialized computer programs may also assist 
in the process of selecting fragments or chemical 
entities. These include: 

1. GRID (P. J. Goodford, "A Computational Procedure for 
Determining Energetically Favorable Binding Sites on 
Biologically Important Macromolecules " , J. Med. Chem. , 
28, pp. 849-857 (1985)). GRID is available from Oxford 
University, Oxford, UK. 
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2. MCSS (A. Miranker et al., "Functionality Maps of 
Binding Sites: A Multiple Copy Simultaneous Search 
Method. " Proteins: Structure, Function and Genetics , 
11, pp. 29-34 (1991)). MCSS is available from Molecular 
Simulations, San Diego, CA. 

3. AUTODOCK (D. S. Goodsell et al., "Automated Docking 
of Substrates to Proteins by Simulated Annealing", 
Proteins: Structure, Function, and Genetics , 8, pp. 195- 
202 (1990)). AUTODOCK is available from Scripps Research 
Institute, La Jolla, CA. 

4. DOCK (I. D. Kuntz et al., "A Geometric Approach to 
Macromolecule-Ligand Interactions", J. Mol. Biol. , 161, 
pp. 269-288 (1982)). DOCK is available from University 
of California, San Francisco, CA. 

Once suitable chemical entities or fragments 
have been selected, they can be assembled into a single 
compound or complex. Assembly may be preceded by visual 
inspection of the relationship of the fragments to each 
other on the three-dimensional image displayed on a 
computer screen in relation to the structure coordinates 
of NS3 helicase. This would be followed by manual model 
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building using software such as Quanta or Sybyl [Tripos 
Associates, St. Louis, MO]. 

Useful programs to aid one of skill in the art 
in connecting the individual chemical entities or 
fragments include: 

1. CAVEAT (P. A. Bartlett et al, "CAVEAT: A Program to 
Facilitate the Structure-Derived Design of Biologically 
Active Molecules", in Molecular Recognition in Chemical 
and Biological Problems ", Special Pub., Royal Chem. Soc, 
78, pp. 182-196 (1989); G. Lauri and P. A. Bartlett, 
"CAVEAT: a Program to Facilitate the Design of Organic 
Molecules", J. Comput . Aided Mol . Pes. , 8, pp. 51-66 
(1994)). CAVEAT is available from the University of 
California, Berkeley, CA. 

2. 3D Database systems such as ISIS (MDL Information 
Systems, San Leandro, CA) . This area is reviewed in Y. 
C. Martin, "3D Database Searching in Drug Design", J. 
Med. Chem. , 35, pp. 2145-2154 (1992). 

3. -HOOK (M. B. Eisen et al, "HOOK: A Program for 
Finding Novel Molecular Architectures that Satisfy the 
Chemical and Steric Requirements of a Macromolecule 
Binding Site", Proteins: Struct., Funct . , Genet. , 19, 
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pp. 199-221 (1994). HOOK is available from Molecular 
Simulations, San Diego, CA. 

Instead of proceeding to build an inhibitor of 
a NS3 helicase-like binding pocket in a step-wise fashion 
one fragment or chemical entity at a time as described 
above, inhibitory or other NS3 helicase binding compounds 
may be designed as a whole or "de novo" using either an 
empty binding site or optionally including some 
portion(s) of a known inhibitor ( s ) . There are many de 
novo ligand design methods including: 

1. LUDI (H.-J. Bohm, "The Computer Program LUDI: A New 
Method for the De Novo Design of Enzyme Inhibitors", J. 
Comp. Aid. Molec. Design , 6, pp. 61-78 (1992) ) . LUDI is 
available from Molecular Simulations Incorporated, San 
Diego, CA. 

2. LEGEND (Y. Nishibata et al.. Tetrahedron , 47, p. 
8985 (1991)). LEGEND is available from Molecular 
Simulations Incorporated, San Diego, CA. 

3. LeapFrog (available from Tripos Associates, St. 
Louis, MO) . 

4. SPROUT (V. Gillet et al, "SPROUT: A Program for 
Structure Generation)", J. Comput . Aided Mol. Design , 7, 
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pp. 127-153 (1993)). SPROUT is available from the 
University of Leeds, UK. 

Other molecular modeling techniques may also be 
employed in accordance with this invention [see, e.g., 
N. C. Cohen et al., "Molecular Modeling Software and 
Methods for Medicinal Chemistry, J. Med. Chem. , 33, pp. 
883-894 (1990); see also, M. A. Navia and M. A. Murcko, 
"The Use of Structural Information in Drug Design", 
Current Opinions in Structural Biology , 2, pp. 202-210 
(1992); L. M. Balbes et al., "A Perspective of Modern 
Methods in Computer-Aided Drug Design", in Reviews in 
Computational Chemistry , Vol. 5, K. B. Lipkowitz and D. 
B. Boyd, Eds., VCH, New York, pp. 337-380 (1994); see 
also, W. C. Guida, "Software For Structure-Based Drug 
Design", Curr. Opin. Struct. Biology ,, 4, pp. 777-781 
(1994) ] . 

Once a compound has been designed or selected 
by the above methods, the efficiency with which that 
entity may bind to an NS3 helicase binding pocket may be 
tested and optimized by computational evaluation. For 
example, an effective NS3 helicase binding pocket 
inhibitor must preferably demonstrate a relatively small 
difference in energy between its bound and free states 
(i.e., a small deformation energy of binding). Thus, the 
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most efficient NS3 helicase binding pocket inhibitors 
should preferably be designed with a deformation energy 
of binding of not greater than about 10 kcal/mole, more 
preferably, not greater than 7 kcal/mole. NS3 helicase 
binding pocket inhibitors may interact with the binding 
pocket in more than one conformation that is similar in 
overall binding energy. In those cases, the deformation 
energy of binding is taken to be the difference between 
the energy of the free entity and the average energy of 
the conformations observed when the inhibitor binds to 
the protein. 

An entity designed or selected as binding to an 
NS3 helicase binding pocket may be further 
computationally optimized so that in its bound state it 
would preferably lack repulsive electrostatic interaction 
with the target enzyme and with the surrounding water 
molecules. Such non-complementary electrostatic 
interactions include repulsive charge-charge, dipole- 
dipole and charge-dipole interactions. 

Specific computer software is available in the 
art to evaluate compound deformation energy and 
electrostatic interactions. Examples of programs 
designed for such uses include: Gaussian 94, revision C 
(M, J. Frisch, Gaussian, Inc., Pittsburgh, PA ©1995) ; 
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AMBER, version 4.1 (P. A. Kollman, University of 
California at San Francisco, ©1995); QUANTA/CHARMM 
(Molecular Simulations, Inc., San Diego, CA©1995); 
Insight II/Discover (Molecular Simulations, Inc., San 
Diego, CA ©1995); DelPhi (Molecular Simulations, Inc., 
San Diego, CA ©1995) ; and AMSOL (Quantum Chemistry 
Program Exchange, Indiana University) . These programs 
may be implemented, for instance, usi:^.g a Silicon 
Graphics workstation such as an Indigo^ with "IMPACT" 
graphics. Other hardware systems and software packages 
will be known to those skilled in the art. 

Another approach enabled by this invention, is 
the computational screening of small molecule databases 
for chemical entities or compounds that can bind in 
whole, or in part, to a NS3 helicase binding pocket. In 
this screening, the quality of fit of such entities to 
the binding site may be judged either by shape 
complementarity or by estimated interaction energy [E. C. 
Meng et al., J. Comp . Chem. , 13, pp. 505-524 (1992)]. 

According to another embodiment, the invention 
provides compounds which associate with a NS3 helicase- 
like binding pocket produced or identified by the method 
set forth above. 
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The structure coordinates set forth in Figure 1 
can also be used to aid in obtaining structural 
information about another crystallized molecule or 
molecular complex. This may be achieved by any of a 
number of well-known techniques, including molecular 
replacement . 

Therefore, in another embodiment this invention 
provides a method of utilizing molecular replacement to 
obtain structural information about a molecule or 
molecular complex whose structure is unknown comprising 
the steps of: 

a) crystallizing said molecule or molecular 
complex of unknown structured- 
fa) generating X-ray difr_-action data from said 
crystallized molecule or molecular complex; and 

c) applying at least a portion of the structure 
coordinates set forth in Figure 1 to the X-ray 
diffraction data to generate a three-dimensional electron 
density map of the molecule or molecular complex whose 
structure is unknown. 

By using molecular replacement, all or part of 
the structure coordinates of the NS3 helicase/ 
oligonucleotide complex as provided by this invention 
(and set forth in Figure 1) can be used to determine the 
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structure of a crystallized molecule or molecular complex 
whose structure is unknown more quickly and efficiently 
than attempting to determine such information ab initio . 

Molecular replacement provides an accurate 
estimation of the phases for an unknown structure. 
Phases are a factor in equations used to solve crystal 
structures that can not be determined directly. 
Obtaining accurate values for the phases, by methods 
other than molecular replacement, is a time-consuming 
process that involves iterative cycles of approximations 
and refinements and greatly hinders the solution of 
crystal structures. However, when the crystal structure 
of a protein containing at least a homologous portion has 
been solved, the phases from the k-own structure provide 
a satisfactory estimate of the phases for the unknown 
structure . 

Thus, this method involves generating a 
preliminary model of a molecule or molecular complex 
whose structure coordinates are unknown, by orienting and 
positioning the relevant portion of the NS3 
helicase/oligonucleotide complex according to Figure 1 
within the unit cell of the crystal of the unknown 
molecule or molecular complex so as best to account for 
the observed X-ray diffraction data of the crystal of the 
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molecule or molecular complex whose structure is unknown. 
Phases can then be calculated from this model and 
combined with the observed X-ray diffraction data 
amplitudes to generate an electron density map of the 
structure whose coordinates are unknown. This, in turn, 
can be subjected to any well-known model building and 
structure refinement techniques to provide a final, 
accurate structure of the unknown crystallized molecule 
or molecular complex [E. Lattman, "Use of the Rotation 
and Translation Functions", in Meth. Enzymol ., 115, pp. 
55-77 (1985); M. G. Rossmann, ed. , "The Molecular 
Replacement Method", Int. Sci. Rev. Ser. , No. 13, Gordon 
& Breach, New York (1972)]. 

The structure of any portion of any 
crystallized molecule or molecular complex that is 
sufficiently homologous to any portion of the NS3 
helicase/oligonucleotide complex can be resolved by this 
method . 

In a preferred embodiment, the method of 
molecular replacement is utilized to obtain structural 
information about another helicase. The structure 
coordinates of NS3 helicase as provided by this invention 
are particularly useful in solving the structure of other 
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isoforms of NS3 helicase or other NS3 helicase-containing 
complexes . 

Furthermore, the structure coordinates of NS3 
helicase as provided by this invention are useful in 
solving the structure of NS3 helicase proteins that have 
amino acid substitutions, additions and/or deletions 
(referred to collectively as "NS3 helicase mutants", as 
compared to naturally occurring NS3 helicase isoforms. 
These NS3 helicase mutants may optionally be crystallized 
in co-complex with a chemical entity, such as a non- 
hydrolyzable NTP analogue or an oligonucleotide. The 
crystal structures of a series of such complexes may then 
be solved by molecular replacement and compared with that 
of wild-type NS3 helicase. Potential sites for 
modification within the various binding sites of the 
enzyme may thus be identified. This information provides 
an additional tool for determining the most efficient 
binding interactions, for example, increased hydrophobic 
interactions, between NS3 helicase and a chemical entity 
or compound. 

The structure coordinates are also particularly 
useful to solve the structure of crystals of NS3 helicase 
or NS3 helicase homologues co-complexed with a variety of 
chemical entities. This approach enables the 
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determination of the optimal sites for interaction 
between chemical entities, including between candidate 
NS3 helicase inhibitors and NS3 helicase. For example, 
high resolution X-ray diffraction data collected from 
crystals exposed to different types of solvent allows the 
determination of where each type of solvent molecule 
resides. Small molecules that bind tightly to those 
sites can then be designed and synthesized and tested for 
their NS3 helicase inhibition activity. 

All of the complexes referred to above may be 
studied using well-known X-ray diffraction techniques and 
may be refined versus 1.5-3 A resolution X-ray data to an 
R value of about 0.20 or less using computer software, 
such as X-PLOR [Yale University, ©1992, distributed by 
Molecular Simulations, Inc.; see, e.g., Blundell & 
Johnson, supra ; Meth. Enzymol . , vol. 114 & 115, H. W. 
Wyckoff et al . , eds . , Academic Press (1985)]. This 
information may thus be used to optimize known NS3 
helicase inhibitors, and more importantly, to design new 
NS3 helicase inhibitors. 

In order that this invention be more fully 
understood, the following examples are set forth. These 
examples are for the purpose of illustration only and are 



not to be construed as limiting the scope of the 
invention in any way. 



EXAMPLE 1 

Cloning and Expression of NS3 Helicase 
The HCV NS3 RNA helicase domain (encoded by 
nucleotides 502-1896 of SEQ ID NO: 6) was subcloned from 
cDNA of the HCV H strain [A. Grakoui et al . , J. Virol. , 
67, pp. 1385-95 (1993); C. Lin et al . , J. Virol. , 68, pp 
8147-57 (1994), the disclosures of which are herein 
incorporated be reference] into a pET expression vector 
(Novagen, Madison, WI) . The resulting plasmid, pET- 
BS (+) /HCV/NS3-C465-His (SEQ ID NO: 4), also contained a 
methionine start codon, a linker encoded Gly-Ser-Gly-Ser 
sequence attached the C-terminal threonine of the NS3 
helicase domain and a six-histidine tag fused to the C- 
terminus of the Gly-Ser-Gly-Ser sequence to facilitate 
protein purification. This plasmid was used as a 
template for single-stranded DNA-based site-directed 
mutagenesis essentially as described by (T.A. Kunkel, 
Proc. Natl. Acad. Sci. USA , 82, pp. 488-492 (1985) and C. 
Lin et al.. Virology , 192, pp. 596-604 (1993), the 
disclosures of which are herein incorporated by 
reference) with the following modifications. 
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The single stranded phagemid DNA packaged in 
the presence of helper M13 phage corresponds to the HCV 
plus strand. A single colony of E. coli strain CJ 326, 
transformed with pET-BS (+) /HCV/NS3-C4 65-His, was grown in 
YT media containing 0.25 |Lig/ral of uridine and 50 flg/ml of 
carbenicillin. After three serial passages, M13 helper 
phage (Bio-Rad) was used to rescue uridylated phagemid 
single stranded DNA, which was then used as template for 
oligonucleotide-directed mutagenesis [T.A. Kunkel (1985), 
supra] . ABI automatic sequencing was used to identify 
mutations and ensure that there is no other unintended 
mutation within the HCV NS3 helicase domain sequences. 
Construct containing mutations were named according to 
the position of the substituted residue in the full- 
length HCV NS3 protein. 

In this manner, we made NS3 helicase 
corresponding to the consensus sequence of the HCV 
genotype 1 (Pro at amino acid 332; Ser at amino acid 403; 
Ala at amino acid 410; and Thr at amino acid 505; 
hereinafter referred to as "wild type"); as well as ISIS3 
helicase containing the following single amino acid 
mutations as compared to the consensus HCV genotype 1 NS3 
helicase sequence: Ser231-to-Ala ; Thr269-to-Ala; Ser370- 
to-Ala; Thr411-to-Ala; Trp50 1-t o-Phe ; Trp501-to-Leu; and 
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Trp501-to-Ala. 

E. coli BL21{DE3) cells, freshly transformed 
with the pET-BS (+) /HCV/NS3-C465-His plasmid or similar 
plasmids encoding the single amino acid mutant NS3 
helicases described above, were grown at 30°C in LB media 
supplemented with 50 |ig/ml of carbenicillin . When the 
density reached an ODgoO of 1.0, the cells were induced 
for 3 hr at 30 °C by the addition of IPTG to a final 
concentration of 0.8 mM. After induction, the cells were 
harvested and stored frozen at -70°C until purification. 

All the protein purification procedures were 
performed at 4°C. Typically, 10 g of cell paste was 
resuspended in 50 ml of buffer A [50 mM HEPES (pH 8), 300 
mM NaCl, 10 % glycerol, and 2 . 5 mM p-mercaptoethanol ] 
containing 0.2 mM phenylmethylsulf onyl fluoride (PMSF), 
and lysed using a microf luidizer . The lysate was 
clarified by centrif ugation at 100,000 x g for 35 min. 
We then added 5 mM imidazole (pH 8) to the supernatant 
and the resulting solution was incubated for 2 hours with 
2 ml of Ni-NTA-agarose (Qiagen, Chatsworth, CA) . 

The lesin was packed into a column and washed 
with 10 bed volumes of buffer A containing 5 mM and 15 mM 
imidazole, and eluted with buffer A containing 100 mM 
imidazole. The eluant was desalted to buffer B [50 mM 
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HEPES (pH 8), 10 % glycerol, and 2.5 mM ^- 
mercaptoethanol] containing 50 mM NaCl on a PD-10 column 
(Pharmacia). The desalted solution was loaded onto a 
Heparin-Sepharose column (Pharmacia) . The flow-through 
was then applied onto a Q-Sepharose column (Pharmacia) 
and washed with 10 bed volumes of buffer B containing 50 
mM NaCl . The column was then eluted with a NaCl gradient 
from 50 mM to 2M in buffer B. 

The peak fraction containing the HCV NS3 
helicase domain protein was shown by gel-filtration 
chromatography to be monomeric. The purified protein was 
judged to be greater than 90% pure by sodium dodecyl 
sulfate (SDS) polyacrylamide gel electrophoresis (PAGE) 
and Coomassie R-250 staining. 

For crystallization studies, the protein was 
concentrated to 10 rag/ml by ultrafiltration and slowly 
diluted with 5 volumes of 15 mM MES (pH 6.5), 2 . 5 mM P- 
mercaptoethanol and again concentrated to 10 mg/ml. The 
dilution step was then repeated with 2 volumes of the MES 
buffer and concentrated to 13 mg/ml. We then added 
oligonucleotide (dUe; Oligo Therapeutics, Inc., 
Wilsonville, OR) to yield a 1:1 molar ration of protein 
to nucleic acid. 
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To produce recombinant full-length NS3 protein 
which was subsequently used for mutagenesis studies in 
the NTP binding pocket, we followed similar procedures as 
above. The full length NS3 coding sequence was also 
subcloned from the HCV H strain. It was placed in a pET 
expression vector to create the plasmid 
pET-BS (+) /HCV/FLNS3-His (SEQ ID N0:3). As with the 
helicase constructs previously described, the full-length 
NS3 coding sequence was preceded by a methionine start 
codon and had codons encoding Gly-Ser-Gly-Ser-Hise in 
frame at the C-terminus of the NS3 coding region. That 
plasmid was used as a template for single-stranded DNA- 
based site-directed mutagenesis as described above. 

In this manner, we made .:S3 containing the 
following single amino acid mutations as compared to the 
consensus HCV genotype 1 NS3 helicase sequence: 
Gln460->Ala, Arg461->Ala, Arg462->Ala, Arg464->Ala and 
Arg467->Ala. 

Both the wild-type full-length NS3 and the 
single amino acid mutants were purified as described 
above . 

EXAMPLE 2 



Crystallization and Data Collection 
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Crystals of the NS3 helicase : dUg complex were 
grown by hanging-drop vapor diffusion over wells 
containing 0.1 M Tris pH 8.0, 0.2 M Li2S04, 18% 
Polyethylene glycol 6000, and 8 mM j3-mercaptoethanol . 
Drops were macroseeded within 12 hours after being set 
up. Crystals grew over the course of 2-3 weeks to 
dimensions of 0.4 x 0.4 x 0.1 mm^ . The crystals belong 
to space group P2i2i2 with unit cell dimensions a=73.lA 
b=117.5A, c=63.4A, and contain one helicase:dU8 complex 
per asymmetric unit. 

Heavy atom soaks were carried out by 
transferring crystals to a solution containing 0.1 M Tris 
pH 8.0, 0.2 M Li2S04, 17% Polyethylene glycol 6000, 8 mM 
P-mercaptoethanol, in addition to the heavy atom in 
question. Heavy atom soaks with K2WO4 were performed in 
the absence of Li2S04. 

Crystals were transferred to a solution 
containing 0.08 M Tris pH 8.0, 0.2 M Li2S04, 16% 
Polyethylene glycol 6000, 8 mM p-mercaptoethanol , and 15% 
glycerol and immediately frozen in a dry nitrogen gas 
stream at 100 K (Molecular Structure Corp., Houston, TX) 
for data collection. 

Data was acquired by oscillation photography on 
a Rigaku R-AXIS IIC phosphor imaging area detector 
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mounted on a Rigaku RU200 rotating anode generator (MSG) , 
operating at 50kV and lOOmA. Measured intensities were 
integrated, scaled, and merged using the HKL software 
package [Z. Otwinowski et al . , Meth. Enzymol . , 276, pp. 
307-326 (1997)]. 

EXAMPLE 3 

Phasing^ model building and refinement 
Heavy atom positions were located from 
difference Patterson and anomalous difference Patterson 
maps and confirmed with difference Fourier syntheses. 
Heavy atom parameters were refined and phases computed to 
2.3A using the program PHASES [W. Furey et al . , Meth. 
Enzymol. , 277, pp. 590-620, (1997) need full cite]. MIR 
phases were improved by cycles of solvent flattening [B. 
C. Wang,. Methods Enzymol. , 115, pp. 90-112 (1985)] 
combined with histogram matching [K. Y. J. Zhang et al . , 
Acta Crystallogr . , A46, pp. 377-381 (1990)] using the CCP4 
crystallographic package [CCP4; C. C. Project, Acta 
Crystallogr. , D50, pp. 760-763 (1994)]. 

Model building was carried out using QUANTA96 
(Molecular Simulations) , and all refinement done in XPLOR 
[A. T. Brunger, "X-PLOR: A System for X-Ray 
Crystallography and NMR, " Yale University Press, New 
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Haven, Connecticut. (1993)], using the free R-value [A. 
T. Brunger, Nature , 355, pp. 472-475 (1992)] to monitor 
the course of refinement. The current model, refined 
using data from 6.0-2.2A, consists of NS3 helicase 
residues 190-414 and 418-626, residues 3-8 of dUs, 1 
bound sulfate ion, and 159 well-ordered water molecules. 

EXAMPLE 4 

Structural Features of the NS3 Helicase-dUg Complex 
The structure of the resolved portion of HCV 
NS3 helicase (NS3 residues 189-626 of SEQ ID N0:1, 
corresponding to HCV polyprotein residues 1215-1652) 
complexed with a deoxyuridine octamer (dUs) was 
determined by multiple isomorphous replacement combined 
with anomalous scattering. The protein consists of three 
domains separated by a series of clefts (Figure 5). 

Domains 1 and 3 share a more extensive 
interface than either share with domain 2. This 
interface is largely accounted for by packing of helices 
a5 and a6 from domain 3 on helix a4 from domain 1. As a 
result, the clefts between domains 1 and 2 and domains 2 
and 3 are the largest. A published crystal structure of 
HCV NS3 helicase domain demonstrated that domain 2 could 
undergo rigid body movements relative to domains 1 and 3 
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based on a comparison of two crystallographically 
independent molecules [N. Yao et al., Nat. Struct. Biol. , 
4, pp. 463-467 (1997)]. Preliminary structural studies 
on HCV helicase in a different crystal form also show a 
rotation of domain 2 relative to the first and third, 
confirming that this domain is flexibly linked to the 
other two. 

Domains 1 and 2 of the HCV helicase, which 
contain all of the conserved helicase sequence motifs, 
have similar topologies (Figure 6} and are also similar 
to domains lA and 2A of the four domain PcrA and Rep DNA 
helicases [H. S. Subramanya et al., Nature, 384, pp. 379- 
383 (1996); S. Korolev et al.. Cell , 90, pp. 635-647 
(1997)]. The structurally homologous domains of PcrA, 
Rep, and HCV helicases each contain a parallel six- 
stranded B-sheet flanked by a-helices. In addition, 
domain 1 of HCV helicase contains a seventh fii-strand 
running antiparallel to the rest of the sheet. 

Superposition of domains 1 and 2 yields an rms 
deviation of 2.0 A for 7 6 C-alpha atoms that form the 
core of each domain. Domain 3 is predominantly a-helical 
and is associated with domain 2 by a pair of anti- 
parallel li-strands (Figure 5) . An interesting component 
of domain 3 is a 40 amino acid region preceding the C- 
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terminal a-helix that lacks secondary structure elements. 
This may represent a flexible region of the protein that 
allows the C-terminus of NS3 to reach the active site of 
its own serine protease domain to facilitate cleavage at 
the NS3/NS4A junction during HCV polyprotein processing. 
This cleavage is believed to occur in cis [R. 
Bartenschlager et al., J. Virol. , 68, pp. 5045-5055 
(1994) ] . 

The N-terminal region of domain 1 contains a 
phosphate binding loop that is highly conserved among all 
helicases and commonly referred to as the Walker A box or 
motif I [J. E. Walker et al., EMBO J. , 1, pp. 945-951 
(1982)]. In the structure presented here, this loop 
contains a bound sulfate ion (Figure 7A) . This phosphate 
binding loop is structurally similar to those found in a 
number of other ATPases [M. Saraste et al.. Trends 
Biochem. Sci. , 15, 430-434 (1990)]. The sulfate ion is 
stabilized by hydrogen bonds from the amide nitrogens of 
Gly-207 and Gly-209, and the side chains of Ser-208, Lys- 
210, and Ser-211. Lys-210 makes an additional water- 
mediated contact to the conserved Asp-290 of the DECH 
motif (motif II or Walker motif B) . 

The side chains of Asp-290 and Glu-291, the 
most conserved residues in the DECH motif, point toward 
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an open area beneath the phosphate binding loop that 
presumably is occupied by Mg2+ and the y-phosphate of the 
bound Mg2+-NTP substrate. Cys-292 is buried in the 
protein interior while the His-293 side chain points into 
the cleft between domains 1 and 2. The position of the 
sulfate in this structure appears to be very similar to 
that of the B-phosphate of ADP in the PcrA helicase:ADP 
complex [H. S. Subramanya et al.. Nature , (1996), supra ] . 
It is therefore likely that this sulfate ion occupies the 
position of the B-phosphate when NTP or NDP is bound to 
the HCV helicase. 

Highly conserved residues Gln-460, Arg-464, and 
Arg-4 67 from domain 2 are solvent exposed in the 
interdomain cleft, while Arg-461 and Arg-462 are buried 
in domain 2 and stabilized by internal salt bridges and 
hydrogen bonds. The position of Arg-4 61 contrasts that 
described in the structure of the apo HCV NS3 helicase, 
which reported this side chain as being solvent exposed 
and interacting with the phosphate backbone of single 
stranded nucleic acid modeled into this cleft [N. Yao et 
al., Nat. Struct. Biol. , (1997), supra ] . 

Protein-single stranded DMA interactions 
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Studies of HCV NS3 helicase single stranded 
nucleic acid binding have demonstrated that poly{dU) 
binds to the helicase with higher affinity than poly(rU) 
of the same length [F. Preugschat et al, J. Biol. Chem. , 
271 (1996), supra]. Extrapolation of this data suggested 
that a deoxyoligonucleotide 8mer might be long enough to 
bind to the helicase with high affinity and not interfere 
with protein-protein contacts during crystallization. 
Therefore an oligo dUs was used for complex 
crystallization. In the structure presented here the 
first two residues of the oligonucleotide are disordered 
and have not been included in the model. The sugar- 
phosphate backbone of the third nucleotide is well 
represented in electron density maps (Figure 7B) while 
density for the base is extremely weak. Electron density 
for residues 4-8 is very well defined for the sugar- 
phosphate backbone and slightly weaker for the bases. 
Preliminary studies with dUio and dUi2 oligonucleotides 
show essentially the same electron density for the DNA. 

The bound single stranded DNA ("ssDNA") lies in 
a channel approximately 16 angstroms in diameter that 
separates domain 3 from domains 1 and 2 (Figure 10) . The 
5' end of the oligonucleotide resides at the interface of 
domains 2 and 3 and its 3' end at the interface of 
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domains 1 and 3. This orientation of the DNA is roughly 
perpendicular to that of the ssRNA in a model derived 
from the apo HCV helicase structure, in which the ssRNA 
was placed in the cleft between domains 1 and 2 [N. Yao 
et al., Nat. Struct. Biol. , (1997), supra ] . It is, 
however, consistent with the oligonucleotide binding site 
in the Rep helicase: DNA structure [S. Korolev et al.. 
Cell , (1997) , supra ] . 

Interactions between the ssDNA and enzyme are 
mostly confined to the DNA backbone, as would be expected 
for a nonspecific protein-nucleic acid complex, and are 
concentrated at the two ends of the oligonucleotide. 
Protein contacts emanate mostly from loops between 
secondary structural elements in c. mains 1 and 2 (Figures 
6A, B). Interestingly, these contacts arise from 
symmetrically equivalent residues in these two domains, 
so that protein contacts to the dU4 and dU5 backbone 
phosphates are nearly identical to those to the dU7 and 
dU8 phosphates. 

At the 3' end of the DNA the dU8 phosphate is 
stabilized by a hydrogen bond with Thr-269 Oy, which in 
turn accepts a hydrogen bond from the main chain NH of 
Lys-272, and by a hydrogen bond to the main chain NH of 
Gly-255. Equivalent contacts to the dU5 phosphate are 
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made by the Arg-393 main chain NH and Thr-411 O7, which 
accepts a hydrogen bond from the Ala-413 NH . The dU7 
phosphate accepts a hydrogen bond from the Val-232 NH and 
interacts with the Ala-233 NH and Ser-231 Oy via a 
bridging water molecule. The direct and water mediated 
main chain interactions are duplicated by Lys-371 and 
Lys-372 from domain 2 to the dU4 phosphate. Ser-370, the 
equivalent residue in domain 2 to Ser-231, makes a water 
mediated contact to the dU3 phosphate rather than dU4 . 
Superposition of domains 1 and 2 of HCV helicase reveals 
that the residues involved in phosphate contacts are 
structurally equivalent (Figure 6) . This was an 
unanticipated finding based on the poor sequence homology 
between these two domains. Additionally, the four 
residues interacting with the phosphate backbone, Ser- 
231, Thr-269, Ser-370, and Thr-411, are absolutely 
conserved in all HCV NS3 sequences known to date. These 
findings suggest that these two domains may have arisen 
from a gene duplication event. 

Residues dU4-dU8 are capped by interactions at 
each end with hydrophobic side chains. Trp-501 stacks 
with the base of dUB while Val-432 interacts with the dU4 
base (Figure 10) . These two side chains act as a pair of 
bookends, defining a central binding cavity occupied by 
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five nucleotides. Both Val-432 and Trp-501 are highly 
conserved among HCV NS3 sequences but neither have been 
implicated in nucleic acid binding nor duplex unwinding. 
The Val-432 :dU4 base interaction induces significant 
rotation about the phosphate backbone between dU3 and dU4 
such that the bases are completely unstacked (Figure 10) . 
Stacking of Trp-501 with the dU8 base should similarly 
necessitate a large rotation about the phosphate of the 
following nucleotide. The resulting conformation of the 
DNA could be stabilized by phosphate interactions with 
Arg-253 and Lys-272 from domain 1 and Lys-372 and Lys-373 
from domain 2, which lie outside the central binding 
cavity . 

Domain 2 contains a pair of extended anti- 
parallel strands encompassing residues 430-452, which are 
involved in binding the 5' end of the oligonucleotide 
(Figure 5). Two other single-strand polynucleotide 
binding proteins, SSB and tRNA synthetase, contain anti- 
parallel strands extending from their protein core that 
are thought to make up the nucleic acid binding site [S. 
Raghunathan et al., Proc. Natl. Acad. Sci . USA , 94, pp. 
6652-6657 (1997); M. Ruff et al.. Science, 252, pp. 1682- 
1689 (1991)]. This region is termed the L45 loop in this 
class of nucleic acid binding proteins. In the HCV 
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helicase structure the oligonucleotide binds in a channel 
spanning two protein domains in a manner roughly similar 
to that seen for replication protein A (RPA) [A. 
Bochkarev et al.. Nature, 385, pp. 176-181 (1997)]. In 
both structures, the oligonucleotide is most tightly 
bound at the 3' and 5' ends with few contacts with the 
central nucleotides. RPA also contains an L45 loop, 
which binds to the 5' end of the oligonucleotide. 

The five residues occupying the central binding 
cavity of HCV helicase adopt a conformation reminiscent 
of the central base pairs of DNA in the TBP-TATA box 
complex structures [Y. Kim et al.. Nature , 365, pp. 512- 
520 (1993); J. L. Kim et al . , Nature, 365, pp. 520-527 
(1993)]. In both instances the DNA is underwound 
considerably and the backbone smoothly bent, compressing 
the edges of the bases. Comparison of the DNA structure 
here with the central pyrimidine stretch of the TATA box 
DNA reveals that this DNA is more underwound than that 
seen in the TBP-TATA box complex. 

Our structure of the helicase idUg complex does 
not offer a ready explanation as to why the enzyme binds 
to poly(dU) with higher affinity than to other 
homopolymer DNAs [F. Preugschat et al, J. Biol . Chem. , 
271 (1996), supra ] . Sequence specific interactions with 
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the DNA bases are not observed within the central binding 
cavity of the helicase. Differences in DNA binding 
affinity between different sequences in this case may be 
a result of differences in energetics of DNA distortion 
and base stacking rather than base-specific hydrogen 
bonding patterns . 

Location of conserved sequence motifs 

There is very high sequence conservation among 
various HCV strains in the NS3 RNA helicase domain with 
>80% sequence identity over the entire 456 amino acid 
polypeptide. The most highly conserved segments of these 
domains correspond to the canonical helicase sequence 
motifs (Figures 8, 9A) [A. Gorbalenya et al . , Curr. Qpin. 
Struct. Biol. , 3, pp. 419-429 (1993)]. In the three- 
dimensional structure residues from these motifs form the 
interface between the first two domains {Figure 9B) . 
Visual inspection of the structures of the PcrA DNA 
helicase from Bacillus stearothermophilus [H. S. 
Subramanya et al.. Nature, (1996), supra ] and E. coli Rep 
DNA helicase [S. Korolev et al.. Cell (1997), supra; 
coordinates not available] suggests overall structural 
similarity between domains lA and 2A of these DNA 
helicases and domains 1 and 2 of the HCV helicase. The 
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locations of the conserved DNA helicase sequence motifs 
overlap with those of the HCV helicase allowing an 
unambiguous alignment of these motifs. Mutagenesis of 
individual residues within these motifs in HCV helicase 
or in other RNA helicases have demonstrated that they are 
essential for enzyme activity. The individual phenotypes 
of these mutants can now be more fully explained using 
the enzyme structure. 

Domain 1 of the HCV helicase has a fold similar 
to that found in a number of adenosine triphosphate 
transphosphorylases, such as adenylate and thymidine 
kinases. In particular, the phosphate binding loop 
formed by motif I (GSGKT) is virtually identical to the 
corresponding loop in the kinases. In these kinases this 
loop is involved in binding the 15 phosphate of ATP. HCV 
helicase has a sulfate bound in this exact location 
(Figure 7A) . Mutation of residues corresponding to HCV 
helicase Lys-210 in other helicases invariably leads to 
inactivation [J. W. George et al., J. Mol . Biol . , 235, 
424-435 (1994); T. W. Seeley et al . , J. Biol. Chem. , 265, 
pp. 7158-7165 (1990) ] . 

Motif II (DExH) is proximal to the GSGKT 
phosphate binding loop and is expected to be involved in 
binding the Mg2+-ATP substrate. In adenylate and 



thymidine kinases, a conserved aspartate binds Mg-^^, 
which helps orient the ATP for nucleophilic attack [M. E. 
Black at al., J. Biol. Chem. , 267, pp. 6801-6806 (1992); 
H. G. Yan et al.. Biochemistry , 30, pp. 5539-5546 
(1991)]. Mutation of the equivalent aspartate residue in 
these kinases or in other helicases inactivates ATP 
hydrolysis [M. E. Black et al . , J. Biol . - Chem. , (1992), 
supra; C. H. Gross et al . , J. ^/irol. , 69, pp. 4727-4736 
(1995); R. M. Brosh Jr., et al . , J. Bacterid. , 177, pp. 
5612-562 (1995); A. Pause et al . , EMBO J. , 11, pp. 2643- 
2654 (1992)]. His-293 is located at the bottom of the 
interdomain cleft and approximately 4 A away from Val-456 
and Gln-460. This histidine appears to be essential for 
coupling the ATPase activity to polynucleotide binding; 
mutations of this histidine in HCV NS3 and vaccinia NPH- 
II helicases result in a functional ATPase with no 
helicase activity [C. H. Gross et al., J. Virol. , (1995), 
supra ; G. M. Heilek et al. J. Virol. , 71, 6264-6266 
(1997)]. Unfortunately, the structure presented here 
does not provide an obvious explanation as to how this 
residue couples the NTPase and unwinding activities. 

Studies in several helicases have looked at the 
effects of mutations in motif VI (QRxGRxGR) , yet a role 
for this motif has not been clearly defined. In the HCV 
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helicase residues in this motif are located in the 1:2 
interdomain cleft. Gln-4 60 lies at the bottom of the 
cleft opposite from His-293. Mutation of the 
corresponding glutamine in vaccinia virus helicase and in 
eIF-4A leads to significant decreases in ATPase activity 
[A. Pause et al., EMBO J. , (1992), supra ; C. H. Gross et 
al., J. Virol. , 70, pp. 1706-1713 (1996)]. There are 
three conserved arginines in motif VI that were proposed 
by N. Yao et al., Nat. Strucr. Biol. (1997), supra, to be 
involved in binding single-stranded RNA in the cleft 
between domains 1 and 2. Our structure of the 
helicase :dU8 complex demonstrates that this 
interpretation is unlikely to be correct. 

Here Arg-461 points away from the cleft and is 
hydrogen-bonded to Asp-412 and Asp-427. Mutation of this 
residue in a vaccinia virus helicase leads to decreases 
in RNA binding [C. H. Gross et al., J. Virol. , (1996), 
supra] , possibly as a consequence of alterations in the 
conformation of Asp-412 which lines the polynucleotide 
binding channel. Arginines 464 and 467 extend into the 
interdomain cleft, directly across from the presumed 
locations of the y and a phosphates of ATP. These 
residues appear to be poised to contact the ATP 
phosphates upon closure of this interdomain cleft. This 



- 75 - 

would be similar to the function of conserved basic 
residues in the second domain of adenylate kinase. 

Consistent with the possibility that Arg-464 
and Arg-4 67 are directly involved in ATP binding, 
mutations of the corresponding residues to Ala or Gin in 
vaccinia NPH-II or eIF-4A reduce the ATPase activity to 
<20% of wild type levels [C. H. Gross et al., J. Virol. , 
(1996), supra ; A. Pause et al., Mol. Cell. Biol. , 13, pp. 
6789-6798 (1993)]. Arg-467 appears to be conserved among 
all three helicase superf amilies (Figure 9A) [A. 
Gorbalenya et al., Curr. Opin. Struct. Biol. (1993), 
supra ] 

Motif III connects domains 1 and 2, and appears 
to be a flexible linker [N. Yao et al., Nat. Struct. 
Biol. (1997), supra ] . Motif la forms part of the B sheet 
core of domain 1, but also extends to the 
oligonucleotide. Residues in motif V both contact the 
oligonucleotide and line the interface between the first 
two domains. In particular, Thr-411 makes a hydrogen 
bond to the phosphate of dU3 of the oligonucleotide. 

The current structure lacks any region 
corresponding to motif IV in Rep and PcrA helicases [H. 
S. Subramanya et al.. Nature (1996), supra ; S. Korolev et 
al . , Cell , (1997), supra ] , members of the superfamily I 
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class of helicases. Previous sequence alignments that 
found similarities within motif IV between superfamily I 
and II helicases were done with rather weak criteria and 
may not have been significant. In the DNA helicases, 
motif IV is responsible for binding the adenine ring of 
ATP [H. S. Subramanya et al . , Nature (1996), supra ; S. 
Korolev et al . , Cell , (1997), supra ] . Mutation of a 
conserved arginine in this motif in UvrD increases the 
ATP Km by 37-fold [M. C. Hall et al . , J. Biol. Chem. , 
272, pp. 18614-18620 (1997)]. 

In HCV helicase either another protein segment 
which is not in the current structure substitutes for 
motif IV or the adenosine ring binds elsewhere. Residues 
from the putative motif IV in HCV helicase include Ser- 
370 and Lys-371, which contact the DNA via a water- 
mediated hydrogen bond and a backbone interaction, 
respectively. Therefore sequences corresponding to motif 
IV in superfamily I and superfamily II helicases occupy 
different regions and appear to have different functions. 
We suggest that a new motif, designated IVa, be used to 
describe residues corresponding to the putative HCV 
helicase motif IV. In E. coli UvrD, motif IVa may 
correspond to the sequence RSNAQSRVL (residues 355-363). 
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Proposed domain closure and translocation 

Conserved, basic residues from motif VI are 
positioned across the interdomain cleft from the expected 
location of the ATP y phosphate in HCV helicase. A very 
similar situation is observed in the structures of the 
adenylate kinases, where basic residues lie across a 
cleft from the ATP binding site. Binding of ATP (or 
analogs) to these kinases leads to a conformational 
change in the enzyme, resulting in the burial of 
previously solvent-exposed phosphates [T. Bilderback at 
al., Biochemistry , 35, pp. 6100-6106 (1996)]. Mutation 
of these conserved basic residues in adenylate kinase 
results in an open structure with poor catalytic activity 
[G. E. Schulz, Faraday Discuss . , 93, pp. 85-93 (1992)]. 

We propose that an analogous closure occurs 
between domains 1 and 2 of HCV helicase upon ATP binding. 
This closure could be driven by interaction of basic 
residues in motif VI with the ATP phosphates. Sequence 
and structural conservation of these basic residues in 
motif VI among superfamily I and II helicases suggests 
that domain closure upon ATP binding is a general feature 
of these enzymes. 

Gln-460 and His-293, from motifs VI and II 
respectively, lie on opposite sides of the interdomain 
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cleft and possibly serve as gatekeepers, altering the 
equilibrium between the open and closed forms based on 
the binding of polynucleotide. Potential interaction of 
residues in these positions was predicted by the 
observation that helicases with the DExH motif II 
sequence usually contain a glutamine in motif VI, whereas 
those with a DEAD sequence contain a histidine [A. 
Gorbalenya et al., Curr. Opin. Struct. Biol. (1993), 
supra] . 

There is structural evidence that the linkage 
of the second domain in HCV helicase to the rest of the 
protein is flexible. In the HCV helicase structure 
reported by N. Yao et al., Nat. Struct. Biol. (1997), 
supra, the differences between the two molecules in the 
asymmetric unit can be attributed to a rotation of domain 
2 by a few degrees. The relatively minor movement of 
domain 2 observed in their structures probably reflects 
changes in the local environment in the crystals. We 
propose a much more substantial conformational change 
would occur when the enzyme binds ATP or suitable analog. 
Our crystallographic results indicate that there are 
significant movements of this domain in different crystal 
forms. A conformational change could explain the 
observed two-stage kinetics of ATP binding to Rep where 



rapid initial binding is followed by a much slower step, 
leading to tighter binding [K. J. Moore et al.. 
Biochemistry , 33, pp. 14550-14564 (1994)]. Evidence for 
conformational changes have been observed for Rep and 
helicase II based on alterations in protease sensitivity 
upon nucleotide binding [K. Chao et al., J. Biol . Chem. , 
265, pp. 1067-1076 (1990)]. Binding of ATP to PcrA 
helicase has also been proposed to lead to a 
conformational change of the enzyme [H. S. Subramanya et 
al.. Nature (1996), supra ] . 

Large conformational changes in a DNA 
metabolizing enzyme are not unique, as they have been 
seen in the structures of mRNA capping enzyme in the 
presence of GTP [K. Hakansson et al . , Cell , 89, pp. 545- 
553 (1997)]. In these structures the guanosine 
nucleotide is bound to the N-terminal domain with the 
phosphates located near the interface with the C-terminal 
domain. In the "open" conformation these domains are 
separated by a 10-13 A cleft. Several residues which are 
highly conserved among mRNA capping enzymes are located 
in the C-terminal domain, including Arg-295 and Arg-298. 

In the "closed" conformation, these residues 
are relocated by approximately 10 A and are bound to the 
GTP p and y phosphates. Closures of large interdomain 
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clefts have also been proposed in the structurally 
homologous ATP-dependent DNA ligases, of which one 
structure has been solved in the open conformation [H. S. 
Subramanya et al.. Cell , 85, pp. 607-615 (1996)]. 

The second domain of HCV helicase also 
interacts with the single-stranded polynucleotide. One 
could envision that movement of this domain results in 
concomitant movement of the nucleic acid substrate 
relative to the protein. Interactions between residues 
in domain 2 such as Val-432 and Thr-448 and the bases at 
the 5' end of the single stranded nucleic acid binding 
site would lead to translocation of the polynucleotide in 
the 5' to 3' direction as domain 2 closes. 

Trp-501 in domain 3 stacks with a base near the 
3' end of the single stranded oligonucleotide and 
disrupts stacking with neighboring bases. Closure of the 
interdomain cleft would force several bases to slip past 
Trp-501. Hydrolysis of ATP would then result in opening 
of the cleft and release of ADP . The orientation of Trp- 
501 favors movement of the polynucleotide in only the 5' 
to 3' direction such that opening of the cleft results in 
net movement of domain 2 in a 3 ' to 5 ' direction. By 
this mechanism the translocation reaction of the helicase 
resembles a ratchet. A general ratchet-like mechanism 
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has been proposed for the RecB helicase based on 
conformational changes observed by protease mapping [R. 
J. Phillips et al., Mol . Gen. Genet. , 254, pp. 319-329 
(1997) ] . 

Such a model suggests that a single ATP 
hydrolysis event can result in protein translocation of 
several bases along a polynucleotide. Studies with the 
UvrD DNA helicase have demonst-ated that the enzyme is 
capable of translocating more than one base per reaction 
cycle [J. A. All et al.. Science , 275, pp. 377-380 

(1997)], although the number of ATP hydrolysis events per 
observed reaction cycle was unknown in this experiment. 
Our model is consistent with predictions that helicases 
need not actively unwind the double-stranded substrate, 
but can function by capturing the single-stranded regions 
which arise due to thermal fluctuations at the fork [Y. 
Z. Chen et al., J. Biomol. Struct. Dyn . , 10, pp. 415-427 

(1992)]. The translocation process proposed here would 
thus be considered an active process while the melting of 
double stranded structure at the fork would be passive. 

The mechanism which we propose is substantially 
different from one described for the Rep helicase by Wong 
and Lohman [I. Wong er al . , Science , 256, pp. 350-355 

(1992)] and recently advanced in a paper describing the 
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3.0 and 3.2 A structures of Rep bound to single-stranded 
DNA [S. Korolev et al., Cell (1997), supra ] ■ As we 
previously noted there is overall structural similarity 
between domains 1 and 2 of HCV helicase and domains lA 
and 2A of Rep. Important to our proposed mechanism, 
these two domains contain all the motifs conserved among 
DNA/RNA helicase sequences listed in Figure 9A. In HCV 
helicase, there is no structural equivalent of Rep domain 
2B which has been proposed to have a critical role in the 
active rolling mechanism [S. Korolev et al.. Cell (1997), 
supra ] . 

EXAMPLE 5 
Assays 

A . Helicase assay 

The standard 3 ' -tailed double-stranded RNA/DNA 
hybrid was prepared as described as follows. The long 
98-nucleotide ("nt") RNA template was transcribed from a 
BsrBI-digested plasmid pSP65 (Promega, Madison, WI) in 
the presence of [a-32p-GTP] (New England Nuclear, Boston, 
MA) . The short 34-nt DNA release strand corresponds to a 
SP6 RNA transcript from a BamHI-digested pSP64 (Promega) . 

Standard helicase reactions (20 jj,l) were 
carried out as follows. HCV NS3 helicase (0.3 or 1 nM) 
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was added to a mixture of 25 mM morpholinepropanesulf onic 
acid (MOPS)-NaOH (pH 6.5), 1 mM ATP, 0 . 5 mM MnCl2, 2 mM 
dithiothreitol (DTT) , 0.1 mg of bovine serum albumin 
(BSA) per ml, 4 units of RNasin (Promega), and 5 nM of 
3 '-tailed double-stranded RNA/DNA hybrid substrate. 
Mixtures were incubated for 20 min at 37 °C and stopped by 
the addition of 5 liters of 5X loading buffer [100 mM 
Tris-Cl (pH7.5), 20 mM EDTA, 50 % glycerol, 0.5 % SDS, 
0.1 % NP-4 0, 0.1 % bromophenol blue, and 0.1 % xylene 
cyanole) . The reactions were then electrophoresed on 10% 
PAGE with 0.5x TBE and 0.1 % SDS. Gels were dried and 
exposed using Fuji 1500 phosphorimager (Fuji, Stamford, 
CT) . Helicase activity was determined by radioactivity 
of the double-stranded substrate and single-stranded 
template . 

First, we characterized unwinding activity of 
the purified wild type NS3 helicase domain protein with 
regarding to the following parameters: protein 
concentration, incubation time course, incubation 
temperature, ATP concentration, pH, monovalent cation 
(Na+) , and divalent cation (Mn2+ and Mg2+) (Fig. 11). 

The helicase unwinding activity increased as 
the protein concentration or incubation time increased 
(Fig. IIA) . At 0.1 nM of the NS3 helicase, the reaction 
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was almost linear with regard to the incubation time up 
to 30 min (Fig. IIA) . Several NS3 helicase mutants 
purified by the same chromatograph method did not show 
any unwinding activity (see Table 1 and Example 6, 
below) , indicating that the unwinding activity shown here 
is due to the purified HCV NS3 helicase, not containment 
proteins from E. coli. 

Higher incubation temperature also led to more 
rapid unwinding of substrate (Fig. IIB) , presumably due 
to lower energy required for break-down of hydrogen bonds 
between two strands at higher temperature. The unwinding 
activity of this NS3 helicase domain was optimal at pH 
6.5, with a very narrow pH window of being active (Fig. 

IIC) . 

In addition, the unwinding reaction was very 
sensitive to the monovalent cation, such as Na"*" (Fig. 

IID) . Addition of 25 mM NaCl decreased the unwinding 
activity to about 15% of that in the absence of extra 
NaCl. 

This helicase activity was absolutely dependent 
on the presence of ATP (Fig. HE) or any other type of 
nucleotide triphosphate (NTP) (data not shown) . The 
unwinding activity increased almost linearly as the 
concentration of ATP increased up to 1 mM (Fig. HE) . 
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However, at 5 inM of ATP, the unwinding activity is lower 
than that at 1 mM of ATP, probably due to inhibition of 
extra Na+ brought in with the ATP. 

The helicase activity also absolutely require 
the presence of divalent cations, such as Mn2+ or Mg2+ 
(Fig. IIF) . However, if the concentration of divalent 
cation was higher than that of ATP, inhibition of the 
helicase activity was observed. At equal concentration 
of ATP and divalent cation (1 mM or 5 inM) , Mn2+ showed 
higher unwinding activity than Mg2+ (Fig. IIF) . 

B. Single Stranded RNA binding assay 

The binding of single stranded RNA ("ssRNA") to 
the HCV NS3 helicase was measured by a nitrocellulose 
filter binding assay. A 34-nt RNA transcript was 
generated from BamHI-digested pSP64 plasmid using SP6 RNA 
polymerase in the presence of [a-32p_GxP] . Standard 
ssRNA binding reactions (40 |ll) were carried out as 
follows . 

HCV NS3 helicase domain protein (6.25 nM) was 
added to a mixture of 25 mM morpholinepropanesulf onic 
acid (MOPS)-NaOH (pH 7.0), 2 mM dithiothreitol (DTT), 0.1 
mg of bovine serum albumin (BSA) per ml, 4 units of 
RNasin (Promega), and 5 nM of [32p]_ssRisiA substrate. 
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Mixtures were incubated for 15 min at 30 °C and filtered 
through a pre-wet nitrocellulose membrane. The filter 
were washed twice with washing buffer [50 mM MOPS-NaOH 
(pH 7.0) and 1 mM EDTA] , dried and quantified in 
scintillation counter. 

Next, we determine several parameters in the 
ssRNA binding to the purified NS3 helicase using a filter 
binding assay (Fig. 12) . The association of ^^P-labeled 
ssRNA to the NS3 helicase was very quick, usually close 
to completion within a couple minutes of incubation (data 
not shown) . As shown in Fig. 12A, binding of ssRNA to 
the NS3 helicase is protein concentration-dependent. 
Under 8 nM of the NS3 helicase, the amount of ssRNA bound 
is a linear function of protein concentration (Fig. 12A, 
insert), and there is 0.445 molecule of ssRNA bound for 
every molecule of the NS3 helicase being present in the 
reaction. The maximal amount of ssRNA binding achieved 
in this reaction is about 94%. The Ka of the ssRNA-NS3 
helicase complex is calculated to be 5.18 nM, at which 
the 50% of maximal binding of ssRNA to the NS3 helicase 
domain was observed. 

We also measured the off rate constant of pre- 
formed SSRNA-NS3 helicase complex (Fig. 12B) . In this 
case, -^^P-labeled ssRNA was incubated with the NS3 
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helicase protein was incubated together for 15 minutes to 
allow the formation of ^^P-ssRNA-NS3 helicase complex. 
Then 50-fold excess of '^H-labeled ssRNA with the same 
sequence was added to the reaction so that any ■^^P-labeled 
ssRNA dissociated from the complex with the NS3 helicase 
would have very little chance to re-associate with the 
NS3 protein again. The dissociation rate was determined 
to be 1.52 x 10"^ min-1. 

We also examined effect of pH, monovalent (Na"^) 
and divalent (Mn^"^) cations on the ssRNA binding to the 
NS3 helicase. In contrast to the unwinding activity, 
ssRNA binding of the NS3 helicase was less sensitive to 
the pH change (Fig. 12C) , The optimal binding was 
observed at pH 7.0, although ssRNA binding did not change 
significantly between pH 6.5 to 8.0. NaCl (Fig. 12D) and 
MgCl2 (Fig. 12E) has an inhibitory effect on the ssRNA 
binding, although this inhibition curve as a function of 
salt concentration is not as sharp as that on unwinding 
activity . 

C. ATPase assay 

ATPase was measured by hydrolysis of ATP to ADP 
using a thin layer chromatography method [J.K. Tamura et . 
al.. Virology , 193, pp. 1-10 (1993)]. Standard ssRNA 
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binding reactions (10 were carried out as follows. 

HCV NS3 helicase domain protein (2 nM) was added to a 
mixture of 50 mM morpholinepropanesulf onic acid (MOPS)- 
NaOH (pH 7.0), 0. 1 mM ATP, 2.5 JiCi of [a-32p_ATP] (NEN) , 
0.5 mM MgCl2, 1 mM dithiothreitol (DTT) , 0.1 mg of bovine 
serum albumin (BSA) per ml, and in the presence or 
absence of 5 [LM poly U (Uridine concentration) 
(Pharmacia). Mixtures were iroubated for 30 min at 37°C 
and terminated by addition of EDTA to a final 
concentration of 20 mM. 0.5 ^ll of the reaction was 
spotted on a polyethyleneimine-cellulose plate, ATP and 
ADP were separated in 375 mM potassium phosphate (pH 3.5) 
and quantified by Fuji phosphorimager . 

ATPase activity of the purified NS3 helicase 
was examined using this method. As shown in Fig. 13, 
this NS3 helicase domain has a basal ATPase activity in 
the absence of any polynucleotide, and the ATPase 
activity was stimulated up to 11-fold in the presence of 
poly(U). The order of ATPase stimulation by 
polynucleotides is poly(U) > poly(C) > poly(A) > poly(G) 
(data not shown) . 

EXAMPLE 6 

Structure-Based Mutagenesis Study of 
RNA-Bindinq Residues of the NS3 Helicase 
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Mutagenesis experiments were performed to 
examine the roles of several residues predicted to be 
important in the NS3 helicase : oligonucleotide interaction 
based upon the crystal structure of that complex. 

Ser231, Thr269, Ser370 and Thr411 formed direct 
water-mediated hydrogen bonds with the phosphate groups 
of the bound oligonucleotide. We replaced each one of 
these amino acids individuallv with alanine (see Example 
1) and observed the effect of that mutation on various 
helicase activities. Alanine substitution at Ser231 or 
Ser370 had no observable effect on basal or polyU- 
stimulated ATPase activity, unwinding activity or ssRNA 
binding activity as compared to wild type helicase (see 
Table 1, below) . Thus, it was concluded that those amino 
acids were not essential to define the oligonucleotide 
binding pocket in NS3 helicase. 

In contrast, alanine substitution at Thr269 or 
Thr411 decreased the ssRNA binding to 20% of wild type 
level and completely abolished both polyU-stimulated 
ATPase activity and unwinding activity. Interestingly, 
basal ATPase activity was unaffected by either of these 
mutations . 

The crystal structure also suggested that the 
side chain of TrpSOl interacts with the bound 
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oligonucleotide. Substitution of this Trp with either 
Ala or Leu resulted in decreased ssRNA binding and 
abolished polyU-stimuiated ATPase activity and unwinding 
activity, although basal ATPase activity was unaffected. 
In contrast a Trp501-to-Phe mutation did not affect basal 
ATPase, unwinding and ssRNA binding activities. This 
mutant was, however, less sensitive to polyU-stimulat ion 
of ATPase activity as compared to the wild type helicase. 
Surprisingly, the ATPase activity of this mutant when 
stimulated by other polynucleotides, such as polyC, was 
similar to that of the wild type. 



Table 1. Mutational Study of Amino Acids in the RNA 
Binding Site of HCV NS3 Helicase 



Ammo 

Acid 

Mutation 


Basaf ATPase Poly-U Stimulated ssRN A Binding 
Activity (% of A TPase Activity Activity (% of 
basal WT (% of basal WT WT level) 
level) level} 


dsRNA/DNA 
Unwinding 
Activity (% of WT 
level) 


None (WT) 


100 


823 


100 


100 


S231->A 


260 


709 


121 


99 8 


T269->A 


60 








S370->A 


104 


694 


124 


109 


T411^A 


274 


20S 


24 




W501->F 


99 


197 


100 


112 


W501->L 


114 


47 


21 


0.0T 


W501->A 


101 


49 


40 


0.36 



Based upon these studies, it is apparent that 
Thr269, Thr411 and TrpSOl are key residues for 
oligonucleotide binding. As indicated above, Thr269 and 
TrpSOl make direct contacts with dU8 . The minimal 
helicase amino acids which define the pocket in which dU8 



- 91 - 

lies are Val232, Thr254, Gly255, Thr269, Gly271, Lys272, 
Ala275, TrpSOl and Tyr502. Thus, any compound which fits 
into a pocket comprising the structural coordinates ± a 
root mean square of 1.5 A or less from the backbone atoms 
of these amino acids is a potential inhibitor of the NS3 
helicase . 

Additional amino acids that are located within 
4 A to 8 A from the dU8 pocket are Pro230, Val256, Thr298, 
Ala497, Lys551, Gln552, Gly554, Glu555, Asn556 and 
Pro558. Thus, the combination of these amino acids with 
those above further define the dU8 pocket. 

Based upon the crystal structure and these 
mutagenesis experiments, it is clear that Thr411 makes 
direct contact with dU4 and is a i<'ey residue in the U4 
binding pocket. Other amino acids that are close enough 
to that U4 pocket to define its shape are His369, Ser370, 
Lys371, Tyr392, Arg393, Asp412, Ala413, Cys431, Val432, 
Gln434, Ile446, Thr448, Arg461, Glu493, Glu555, Asn556 
and Phe557. 

EXAMPLE 7 

Structure-Based Mutagenesis Study of 
ATP-Binding Residues of the NS3 Helicase 

Mutagenesis experiments were performed to 

examine the roles of several residues predicted to be 



important in the NS3 helicaserATP interaction based upon 
the crystal structure of that complex. 

The mutations were achieved by the methods 
described in Example 1. 



Table 2. Mutational Study of Amino Acids in the ATP 
Po/y U-stimu/ated 



Mutation 


Basal ATPase 
(% of basal 
wt level) 


ATPase 
C/« of basal 
wt level) 


ssRNA binding 
(% of wt level) 


dsRNA 
/DMA unwinding 
(% of wt level) 


wild-type 


100 


581 


100 


100 


Q460->A 


23 


32 


97 


3 


R461.>A 


140 


193 






R462->A 


247 


337 


99 


81 


R464<>A 






105 


<0 01 


R467->A 




14 


116 


<0.05 



In our model, R4 64 and R4 67 were predicted to 



bind to the y- and a- phosphate groups of NTP, 
respectively. This is in contrast to what has previously 
been reported in the art, wherein these residues were 
predicted to be involved in RNA binding [T. Yao et al., 
Nat. Struct. Biol. , 4, pp. 463-467 (1997); C. Hyun-Soo et 
al., J. Biol. Chem. , 273, pp. 15045-15052 (1998)]. 

Two individual mutations, R464->A and R467->A, 
showed very low basal and polyU-st imulated ATPase 
activities. Although they had normal RNA binding 
ability, which suggested that the mutated protein has a 
proper fold, helicase unwinding activity was almost non- 
existent in these two mutant proteins, presumably due to 
the loss of NTPase activity. These results indicated 
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that these two Arg residues are critical for NTPase 
activity . 

The Q460->A mutation had a similar effect as 
two above-mentioned Arg-to-Ala mutations. This Gin was 
predicted to interact with and maintain the proper 
conformation of the imidazole ring of His-293 of the DECH 
motif . 

The R4 61->A mutation led to lower RNA binding 
and less polyU stimulation of ATPase activity, which 
resulted in a very low helicase unwinding activity. 

The R462->A mutation had no major effect on any 
of these four activities as predicted. 

While we have hereinbefore presented a number 
of embodiments of this invention, it is apparent that my 
basic construction can be altered to provide other 
embodiments which utilize the methods of this invention. 
Therefore, it will be appreciated that the scope of this 
invention is to be defined by the claims appended hereto 
rather than the specific embodiments which have been 
presented hereinbefore by way of example. 



- 94 - 



CLAIMS 

We claim: 

1. A crystallizable composition comprising an 
HCV NS3 helicase protein and an oligonucleotide. 

2. The composition according to claim 1, 
wherein said HCV NS3 helicase comprises amino acids 167- 
631 of SEQ ID N0:1 and wherein said oligonucleotide is a 
single stranded polynucleotide of between 6 and 12 
nucleotides in length. 

3. A crystallized complex comprising an HCV 
NS3 helicase protein and an oligonucleotide. 

4. The crystallized complex according to 
claim 3, wherein said oligonucleotide is a single 
stranded polynucleotide of between 6 and 12 nucleotides 
in length. 

5. A method of producing a crystallized 
complex comprising an HCV NS3 helicase and an 
oligonucleotide comprising the steps of: 
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a. obtaining a crystallizable 
composition comprising an NS3 helicase protein and an 
oligonucleotide in a molar ratio of between 1:5 and 5:1; 
and 

b. subjecting said composition to 
conditions which promote crystallization. 

6. The method according to claim 5, wherein 
said HCV NS3 helicase comprises amino acids 167-631 of 
SEQ ID N0:1 and wherein said oligonucleotide is a single 
stranded polynucleotide of between 6 and 12 nucleotides 
in length. 

7 , A computer for producing a three- 
dimensional representation of: 

a. a molecule or molecular complex, 
wherein said molecule or molecular complex comprises a 
binding pocket defined by structure coordinates of NS3 
amino acids Val232, Thr254, Gly255, Thr269, Gly271, 
Lys272, Ala275, Trp501 and Tyr502 according to Figure 1; 
or 

b. a homologue of said molecule or 
molecular complex, wherein said homologue comprises a 
binding pocket that has a root mean square deviation from 
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the backbone atoms of said amino acids of not more than 
l.SA, wherein said computer comprises: 

(i) a machine-readable data storage medium 
comprising a data storage material encoded with machine- 
readable data, wherein said data comprises the structure 
coordinates of NS3 amino acids Val232, Thr254, Gly255, 
Thr269, Gly271, Lys272, Ala275, TrpSOl and Tyr502 
according to Figure 1; 

(ii) a working memory for storing instructions 
for processing said machine-readable data; 

(ill) a central-processing unit coupled to said 
working memory and to said machine-readable data storage 
medium for processing said machine readable data into 
said three-dimensional representation; and 

(iv) a display coupled to said central- 
processing unit for displaying said three-dimensional 
representation . 

8. The computer according to claim 7, wherein 
said computer produces a three-dimensional representation 
of: 

a) a molecule or molecular complex 
comprising a binding pocket defined by the structure 
coordinates of NS3 helicase amino acids Val232, Thr254, 
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Gly255, Thr269, Gly271, Lys272, Ala275, TrpSOl, Tyr502, 
Pro230, Val256, Thr298, Ala497, Lys551, Gln552, Gly554, 
Glu555, Asn556 and Pro558, according to Figure 1; or 

b) a homologue of said molecule or 
molecular complex, wherein said homologue comprises a 
binding pocket that has a root mean square deviation from 
the backbone atoms of said amino acids of not more than 
1.5A; and 

wherein said machine readable data comprises the 
structure coordinates of NS3 amino acids Val232, Thr254, 
Gly255, Thr269, Gly271, Lys272, Ala275, TrpSOl, Tyr502, 
Pro230, Val256, Thr298, Ala497, Lys551, Gln552, Gly554, 
Glu555, Asn556 and Pro558, according to Figure 1. 

9. A computer for producing a three- 
dimensional representation of: 

a) a molecule or molecular complex 
comprising a binding pocket defined by the structure 
coordinates of NS3 helicase amino acids His369, Ser370, 
Lys371, Tyr392, Arg393, Thr411, Asp412, Ala413, Cys431, 
Val432, Gln434, Ile446, Thr448, Arg461, Glu493, Glu555, 
Asn556 and Phe557, according to Figure 1; or 

b) a homologue of said molecule or 
molecular complex, wherein said homologue comprises a 
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binding pocket that has a root mean square deviation from 
the backbone atoms of said amino acids of not more than 
I.5A, wherein said computer comprises: 

(i) a machine-readable data storage medium 
comprising a data storage material encoded with machine- 
readable data, wherein said data comprises the structure 
coordinates of NS3 amino acids His369, Ser370, Lys371, 
Tyr392, Arg393, Thr411, Asp412, Ala413, Cys431, Val432, 
Gln434, Ile446, Thr448, Arg461, Glu493, Glu555, Asn556 
and Phe557, according to Figure 1 ; 

(ii) a working memory for storing instructions 
for processing said machine-readable data; 

(iii) a central-processing unit coupled to said 
working memory and to said machine-readable data storage 
medium for processing said machine readable data into 
said three-dimensional representation; and 

(iv) a display coupled to said central- 
processing unit for displaying said three-dimensional 
representation. 

10. A computer for producing a three- 
dimensional representation of: 

a) a molecule or molecular complex 
comprising a binding pocket defined by the structure 



- 99 - 

coordinates of NS3 helicase amino acids Pro205, Thr206, 
Gly207, Ser208, Gly209, Lys210, Ser211, Thr212, Lys213, 
Asn229, Ala234, Gly237, Phe238, Tyr241, Asp290, Glu291, 
His293, Thr322, Ala323, Thr324, Gln460, Gly463, Arg464 
and Arg4 67, according to Figure 1; or 

b) a homologue of said molecule or 
molecular complex, wherein said homologue comprises a 
binding pocket that has a root mean square deviation from 
the backbone atoms of said amino acids of not more than 
1.5 A, wherein said computer comprises: 

(i) a machine-readable data storage medium 
comprising a data storage material encoded with machine- 
readable data, wherein said data comprises the structure 
coordinates of NS3 amino acids Pr^205, Thr206, Gly207, 
Ser208, Gly209, Lys210, Ser211, Thr212, Lys213, Asn229, 
Ala234, Gly237, Phe238, Tyr241, Asp290, Glu291, His293, 
Thr322, Ala323, Thr324, Gln460, Gly463, Arg464 and 

Arg4 67, according to Figure 1; 

(ii) a working memory for storing instructions 
for processing said machine-readable data; 

(iii) a central-processing unit coupled to said 
working memory and to said machine-readable data storage 
medium for processing said machine readable data into 
said three-dimensional representation; and 
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(iv) a display coupled to said central- 
processing unit for displaying said three-dimensional 
representation. 

11. The computer according to any one of 
claims 7 to 10, wherein said computer produces a three- 
dimensional representation of: 

a. a molecule or molecular complex 
defined by structure coordinates of all of the NS3 amino 
acids set forth in Figure 1, or 

b. a homologue of said molecule or 
molecular complex, wherein said homologue comprises a 
binding pocket that has a root mean square deviation from 
the backbone atoms of said amino acids of not more than 

1 . SA; and 

wherein said machine readable data contains the 
coordinates of all of the NS3 helicase amino acids set 
forth in Figure 1 . 

12. A computer for determining at least a 
portion of the structure coordinates corresponding to X- 
ray diffraction data obtained from a molecule or 
molecular complex, wherein said computer comprises: 
(a) a machine-readable data storage 
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medium comprising a data storage material encoded with 
machine-readable data, wherein said data comprises at 
least a portion of the structural coordinates of NS3 
helicase according to Figure 1; 

(b) a machine-readable data storage 
medium comprising a data storage material encoded with 
machine-readable data, wherein said data comprises X-ray 
diffraction data obtained from said molecule or molecular 
complex; 

(c) a working memory for storing 
instructions for processing said machine-readable data of 
(a) and (b) ; 

(d) a central-processing unit coupled to 
said working memory and to said machine-readable data 
storage medium of (a) and (b) for performing a Fourier 
transform of the machine readable data of (a) and for 
processing said machine readable data of (b) into 
structure coordinates; and 

(e) a display coupled to said central- 
processing unit for displaying said structure coordinates 
of said molecule or molecular complex. 

13. The computer according to claim 12, 
wherein said molecule or molecular complex comprises a 
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polypeptide having helicase activity. 

14. A method for evaluating the potential of a 
chemical entity to associate with: 

a) a molecule or molecular complex 
comprising a binding pocket defined by structure 
coordinates of NS3 helicase amino acids Val232, Thr254, 
Gly255, Thr269, Gly271, Lys272, Ala275, TrpSOl and Tyr502 
according to Figure 1, or 

b) a homologue of said molecule or 
molecular complex, wherein said homologue comprises a 
binding pocket that has a root mean square deviation from 
the backbone atoms of said amino acids of not more than 
1.5 A comprising the steps of: 

i) employing computational means to perform a 
fitting operation between the chemical entity and a 
binding pocket of the molecule or molecular complex; and 

ii) analyzing the results of said fitting 
operation to quantify the association between the 
chemical entity and the binding pocket. 



15. The method according to claim 14, wherein 
said method evaluates the potential of a chemical entity 
to associate with: 
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a) a molecule or molecular complex 
comprising a binding pocket defined by structure 
coordinates of NS3 helicase amino acids Val232, Thr254, 
Gly255, Thr269, Gly271, Lys272, Ala275, TrpSOl, Tyr502, 
Pro230, Val256, Thr298, Ala497, Lys551, Gln552, Gly554, 
Glu555, Asn556 and Pro558 according to Figure 1, or 

b) a homologue of said molecule or 
molecular complex, wherein said homologue comprises a 
binding pocket that has a root mean square deviation from 
the backbone atoms of said amino acids of not more than 
1,5 A. 

16. A method for evaluating the potential of a 
chemical entity to associate with: 

a) a molecule or molecular complex 
comprising a binding pocket defined by structure 
coordinates of NS3 helicase amino acids His369, Ser370, 
Lys371, Tyr392, Arg393, Thr411, Asp412, Ala413, Cys431, 
Val432, Gln434, Ile446, Thr448, Arg461, Glu493, Glu555, 
Asn556 and Phe557 according to Figure 1, or 

b) a homologue of said molecule or 
molecular complex, wherein said homologue comprises a 
binding pocket that has a root mean square deviation from 
the backbone atoms of said amino acids of not more than 
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1.5 A comprising the steps of: 

i) employing computational means to perform a 
fitting operation between the chemical entity and a 
binding pocket of the molecule or molecular complex; and 

ii) analyzing the results of said fitting 
operation to quantify the association between the 
chemical entity and the binding pocket. 

17. A method for evaluating the potential of a 
chemical entity to associate with: 

a) a molecule or molecular complex comprising 
a binding pocket defined by structure coordinates of NS3 
helicase amino acids Pro205, Thr206, Gly207, Ser208, 
Gly209, Lys210, Ser211, Thr212, Lys213, Asn229, Ala234, 
Gly237, Phe238, Tyr241, Asp290, Glu291, His293, Thr322, 
Ala323, Thr324, Gln460, Gly463, Arg464 and Arg467 
according to Figure 1, or 

b) a homologue of said molecule or molecular 
complex, wherein said homologue comprises a binding 
pocket that has a root mean square deviation from the 
backbone atoms of said amino acids of not more than 1.5A 
comprising the steps of: 

i) employing computational means to perform a 
fitting operation between the chemical entity and a 
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binding pocket of the molecule or molecular complex; and 

ii) analyzing the results of said fitting 
operation to quantify the association between the 
chemical entity and the binding pocket. 

18. The method according to any one of claims 
14 to 17, wherein said method evaluates the potential of 
a chemical entity to associate with a molecule or 
molecular complex: 

a. defined by structure coordinates of 
all of the NS3 helicase amino acids, as set forth in 
Figure 1, or 

b. a homologue of said molecule or 
molecular complex having a root mean square deviation 
from the backbone atoms of said amino acids of not more 
than 1.5 A. 

19. A method for identifying a potential 
agonist or antagonist of a molecule comprising a NS3 
helicase U8-like binding pocket comprising the steps of: 

a. using the atomic coordinates of 
Val232, Thr254, Gly255, Thr269, Gly271, Lys272, Ala275, 
TrpSOl and Tyr502 according to Figure 1 ± a root mean 
square deviation from the backbone atoms of said amino 
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acids of not more than 1.5A, to generate a three- 
dimensional structure of molecule comprising a NS3 
helicase U8-like binding pocket; 

b. employing said three-dimensional 
structure to design or select said potential agonist or 
antagonist ; 

c. synthesizing said agonist or 

antagonist; and 

d. contacting said agonist or antagonist 
with said molecule to determine the ability of said 
potential agonist or antagonist to interact with said 
molecule - 

20. The method according to claim 19, wherein 
the atomic coordinates used in step a. comprise Val232, 
Thr254, Gly255, Thr269, Gly271, Lys272, Ala275, TrpSOl, 
Tyr502, Pro230, Val256, Thr298, Ala497, Lys551, Gln552, 
Gly554, Glu555, Asn556 and Pro558 according to Figure 1 + 
a root mean square deviation from the backbone atoms of 
said amino acids of not more than l.sA. 

21. A method for identifying a potential 
agonist or antagonist of a molecule comprising a NS3 
helicase U4-like binding pocket comprising the steps of: 
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a. using the atomic coordinates of 
His369, Ser370, Lys371, Tyr392, Arg393, Thr411, Asp412, 
Ala413, Cys431, Val432, Gln434, Ile446, Thr448, Arg461, 
Glu493, Glu555, Asn556 and Phe557 according to Figure 1 ± 
a root mean square deviation from the backbone atoms of 
said amino acids of not more than 1 . 5A, to generate a 
three-dimensional structure of molecule comprising a NS3 
helicase US-like binding pocket; 

b. employing said three-dimensional 
structure to design or select said potential agonist or 
antagonist ; 

c. synthesizing said agonist or 

antagonist; and 

d. contacting said agonist or antagonist 
with said molecule to determine the ability of said 
potential agonist or antagonist to interact with said 
molecule . 



22. A method for identifying a potential 
agonist or antagonist of a molecule comprising a NS3 
helicase ATP-like binding pocket comprising the steps of: 

a. using the atomic coordinates of 
Pro205, Thr206, Gly207, Ser208, Gly209, Lys210, Ser211, 
Thr212, Lys213, Asn229, Ala234, Gly237, Phe238, Tyr241, 
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Asp290, Glu291, His293, Thr322, Ala323, Thr324, Gln460, 
Gly463, Arg464 and Arg467 according to Figure 1 ± a root 
mean square deviation from the backbone atoms of said 
amino acids of not more than 1.5 A, to generate a three- 
dimensional structure of molecule comprising a NS3 
helicase U8-like binding pocket; 

b. employing said three-dimensional 
structure to design or select said potential agonist or 
antagonist ; 

c. synthesizing said agonist or 

antagonist; and 

d. contacting said agonist or antagonist 
with said molecule to determine the ability of said 
potential agonist or antagonist to interact with said 
molecule. 

23. The method according to any one of claims 
19 to 22, wherein in step a., the atomic coordinates of 
all the amino acids of NS3 helicase according to Figure 1 
+ a root mean square deviation from the backbone atoms of 
said amino acids of not more than 1 . 5A are used. 
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24. A method of obtaining structural 
information about a molecule or molecular complex whose 
structure is unknown comprising the steps of: 

a) crystallizing said molecule or molecular 
complex of unknown structure; 

b) generating X-ray diffraction data from said 
crystallized molecule or molecular complex; and 

c) applying at least a portion of the structure 
coordinates set forth in Figure 1 to the X-ray 
diffraction data to generate a three-dimensional electron 
density map of the molecule or molecular complex whose 
structure is unknown. 
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ABSTRACT 

The invention relates to the X-ray crystal 
structure of the hepatitis C virus helicase domain. More 
specifically, the invention relates to crystallized 
complexes of HCV helicase and an oligonucleotide, to 
crystallizable compositions of HCV helicase and an 
oligonucleotide and to methods of crystallizing an HCV 
helicase-oligonucleotide complex. The invention further 
relates to a computer programmed with the structure 
coordinates of the HCV helicase oligonucleotide binding 
pocket or the HCV helicase nucleotide triphosphate pocket 
wherein said computer is capable of displaying a three- 
dimensional representation of that binding pocket. 
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FIGURE 1 (cont.) 
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FIGURE 1 (cont.) 
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FIGURE 1 (cont.) 
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9.529 
10.993 
11.290 
10.915 
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9.561 
12.787 
13.512 
13 . 135 
14.534 
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14.817 
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14.961 
14.338 
15.063 
16 .587 
15.276 
16.582 
17.732 
15.548 
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11 .316 
11.111 
12.045 
11.873 
9.993 
9.808 
10 .749 
10.573 
10 .629 
11.441 
9.826 
9.850 
8.460 
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7.887 
7.005 
7.395 
10.821 
10.571 
11.959 
12.998 
14.354 
15.359 
14.165 
13 . 190 
13 .220 
13.330 
13 .551 
12.471 
12.710 
12.577 
11.717 
14.925 
15.245 
15.752 
17.087 
18. 173 
19.532 
17.881 
17.165 
17. 040 
17.322 
17.397 
16.590 
15 . 079 
14.488 
14.477 
18.833 
19.556 
19 . 211 
20.557 
21.301 

22 .770 

23 .195 
23 .568 
20 . 482 



25.875 
24.940 
23 .665 
22 .906 
22 .974 
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22 .096 
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23 .883 
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22.008 
22 . 184 
21 .794 
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FIGURE 1 (cont.) 



ATOM 286 0 ASN 229 

ATOM 2 37 N PRO 23 0 

ATOM 288 CD PRO 23 0 

ATOM 289 CA PRO 23 0 

ATOM 290 CB PRO 23 0 

ATOM 291 CG PRO 23 0 

ATOM 292 C PRO 23 0 

ATOM 293 O PRO 23 0 

ATOM 294 M SER 231 

ATOM 295 CA SER 231 

ATOM 296 CB SER 231 

ATOM 297 OG SER 231 

ATOM 298 C SER 231 

ATOM 299 0 SER 231 

ATOM 300 N VAL 232 

ATOM 301 CA VAL 232 

ATOM 3 02 CB VAL 232 

ATOM 3 03 CGI VAL 232 

ATOM 3 04 CG2 VAL 232 

ATOM 3 05 C VAL 23 2 

ATOM 3 06 O VAL 23 2 

ATOM 3 07 N ALA 23 3 

ATOM 3 08 CA ALA 23 3 

ATOM 3 09 C3 ALA 23 3 

ATOM 310 C ALA 23 3 

ATOM 311 0 ALA 23 3 

ATOM 312 N ALA 23 4 

ATOM 313 CA ALA 23 4 

ATOM 314 CB ALA 23 4 

ATOM 315 C ALA 234 

ATOM 316 O ALA 234 

ATOM 317 N THR 23 5 

ATOM 318 CA THR 23 5 

ATOM 319 CB THR 23 5 

ATOM 320 OGl THR 235 

ATOM 321 CG2 THR 235 

ATOM 3 22 C THR 235 

ATOM 3 23 0 THR 23 5 

ATOM 3 24 N LEU 23 6 

ATOM 325 CA LEU 23 5 

ATOM 326 CB LEU 236 

ATOM 327 CG LEU 23 6 

ATOM 3 28 CDl LEU 23 6 

ATOM 329 CD2 LEU 236 

ATOM 330 C LEU 236 

ATOM 331 O LEU 236 

ATOM 332 N GLY 237 

ATOM 333 CA GLY 237 

ATOM 334 C GLY 237 

■ ATOM 335 O GLY 237 

ATOM 336 N PRE 238 

ATOM 337 CA PRE 238 

ATOM 338 CB PRE 238 

ATOM 339 CG PRE 238 

ATOM 340 CDl PRE 23 8 

ATOM 3 41 CD2 PHS 23 8 

ATOM 3 42 CEl PRE 23 8 

ATOM 3 43 CE2 PHE 23 8 



18.756 19.525 20.938 1.00 19.92 

18.659 21.429 19.753 1.00 23.12 

17.987 22.400 18.872 1.00 21.71 

20.107 21.397 19.495 1.00 23.95 

20.269 22.383 18.336 1.00 23.83 

19.114 23.319 18.510 1.00 23.76 

21.011 21.755 20.688 1.00 24.47 

22.114 21.211 20.834 1.00 25.45 

20.512 22.595 21.584 1.00 24.28 

21.290 23.036 22.727 1.00 24.31 

20.886 24.461 23.088 1.00 23.88 

21.566 24.896 24.251 1.00 27.81 

21.236 22.165 23.975 1.00 25.03 

20.157 21.870 24.496 1.00 25.25 

22.413 21.805 24.480 1.00 24.64 

22.525 21.011 25.694 1.00 24.77 

24.006 20.628 25.975 1.00 24.98 

24.184 20.163 27.417 1.00 24.08 

24.458 19.534 25.019 1.00 20.75 

21.976 21.862 26.852 1.00 26.64 

21.174 21.394 27.668 1.00 28.40 

22.355 23.133 26.879 1.00 26.57 

21.916 24.051 27.922 1.00 26.13 

22.592 25.417 27.759 1.00 25.46 

20.401 24.203 27.907 1.00 25.51 

19.754 24.124 28.952 1.00 26.22 

19.831 24.397 26.718 1.00 25.47 

18.385 24.553 26.567 1.00 24.61 

18.031 24.857 25.138 1.00 23.89 

17.653 23.304 27.015 1.00 25.81 

16.652 23.393 27.728 1.00 25.42 

18.157 22.144 26.591 1.00 26.43 

17.556 20.868 26.951 1.00 26.97 

18.339 19.673 26.345 1.00 27.79 

18.322 19.768 24.916 1.00 27.58 

17.704 18.343 26.748 1.00 26.60 

17.499 20.746 28.472 1.00 27.43 

16.440 20.488 29.035 1.00 28.23 

18.624 20.984 29.138 1.00 27.19 

18.670 20.898 30.595 1.00 27.35 

20.086 21.185 31.096 1.00 28.04 

21.174 20.151 30.842 1.00 27.41 

22.519 20.838 30.894 1.00 28.18 

21.086 19.028 31.858 1.00 26.49 

17.715 21.898 31.235 1.00 26.83 

17.054 21.601 32.232 1.00 27.18 

17.651 23.086 30.643 1.00 26.16 

16.804 24.131 31.178 1.00 26.03 

15.333 23.812 31.228 1.00 25.39 

14.651 24.230 32.151 1.00 25.99 

14.841 23.065 30.252 1.00 25.78 

13.428 22.721 30.205 1.00 27.27 

13.138 21.811 29.013 1.00 27.38 

13.274 22.495 27.591 1.00 29.37 

12.493 23.610 27.388 1.00 29.57 

14.202 22.052 26.756 1.00 29.17 

12.641 24.273 26.169 1.00 29.22 

14.354 22.707 25.541 1.00 28.63 
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FIGURE 1 (cont.) 
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11.740 
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11.590 

13 .429 
12.904 

14 . 007 
11.732 
10.736 
11.836 
10.770 
11.139 
10.037 
10.086 

9.052 
8.919 
7.889 
7.958 
6.915 
9 .464 

8 .402 

9 .547 
8 .368 
8.735 
8 .956 
7 .445 

6 .543 

7 .732 
6.507 

8 .581 
8.125 
9.325 
8.916 
7.378 
5.226 
8 . 000 
7 .387 
8.414 
5.148 
5 . 144 
5.205 
5.093 
5 . 601 
3 .966 

2 .804 
4.295 

3 .280 
3 . 437 
3 .348 
4.309 
2. 147 
2 .359 



23 .822 
22.087 
22 .267 
21.377 
20.728 
21.653 
21.420 
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23 .667 
24 . 511 
24.460 
24.673 
24.873 
25.653 
26 . 095 
26.903 
28 .294 
29.038 
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28 .384 
29.107 
24^885 
25.452 
23 . 513 
22.767 
21.454 
21. 646 
22.198 
20.617 
22.501 
22 .451 

22 .366 
22.137 
21.864 
21.422 
23 .390 
23.320 
24.547 
25 .806 
26 . 947 
26.182 
26 . 608 
25.994 
26.369 
25 . 594 
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24 . 083 
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22 .414 
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21.961 
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35 .785 
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33 .957 
33.340 
31.920 
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30.657 
30.690 
30.133 
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29 .501 
33 .288 
33 .500 
32.932 
32 .824 
32 .105 
30.602 
29 .767 
29.574 
34.191 
34.339 
35 .195 
36 .543 
37 .445 
38.725 
36.994 
37 .420 
36.806 
37 .218 
37 . 163 
36.420 
36.985 
35.113 
34.261 
32,371 
34.171 
34.001 
34.278 
34.134 
32 .751 
31.526 
30 .882 
31.161 
30.179 
29.991 
35.208 



1.00 27.73 

1.00 28.44 

1.00 28.75 

1.00 29.43 
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1.00 30.46 
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1.00 31.68 
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1.00 31.44 

1.00 33.57 

1.00 35.11 

1.00 36.03 

1.00 33.41 

1.00 34.12 

1.00 36.46 

1.00 39.06 

1.00 29.43 

1.00 28.88 

1.00 28.93 

1.00 29.09 

1.00 28.71 

1.00 27.19 

1.00 24.74 

1.00 25.21 

1.00 28.87 

1.00 25.61 

1.00 31.13 

1.00 31.64 

1.00 32.82 

1.00 37.37 

1.00 31.87 
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1.00 31.20 
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1.00 29.56 

1.00 30.91 

1.00 27.79 

1.00 27.16 

1.00 26.05 

1.00 27.41 

1.00 27.55 

1.00 27.72 

1.00 26.52 

1.00 26.99 

1.00 24.43 

1.00 26.07 

1.00 26.33 

1.00 25.34 

1.00 26.40 

1.00 26.80 

1.00 25.51 
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FIGURE 1 (cont.) 
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9.342 
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11. 944 
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14. 131 
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FIGURE 1 (cont.) 
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253 
258 
258 
259 
259 
259 
259 
259 
259 
259 
259 
250 
260 
250 
260 
250 
250 
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251 
261 
251 



261 
251 
252 
252 
252 
262 
263 



23.374 14.370 27.289 1.00 35.29 
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24.435 
25.680 
25.217 
26.605 
26.192 
26.684 
26.869 
28.182 
25.705 
25.839 
26.356 
24.540 
23.651 
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22.434 
22 .485 
21.615 
21.608 
22 .424 
20.778 
22.291 
21.629 
21.896 
20.601 
20.730 
21.112 
19.411 
19.854 
20.431 
18.635 
17.815 
17.506 
16.631 
18.805 
18.593 
16.516 
15.754 
15.307 
15.090 
15.384 
16.401 
14.125 
14.075 
14.356 
12.894 
11.842 
11.392 
12.527 
10.337 
10.628 
9.843 
10.475 
9.316 
3.185 
7.037 
8.525 



13 .313 
14.818 
14.075 
14.023 
12.963 
15.169 
15.252 
16.712 
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13 .667 
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14.203 
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14.762 
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15.118 
13 .858 
12.935 
13 . 513 
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14.385 
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11.949 
10.521 
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9 .770 
11.902 
11.489 
12.398 
12 . 424 
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13.891 
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15.115 
11.736 
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10 .532 
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8 .297 
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7 .573 
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10 .452 
10.740 
12 . 185 
13.049 
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9.829 
9.700 
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3.323 
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8 .708 
10 .363 



27.926 
26.625 
25.656 
28 .072 
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28.744 
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31.376 
33.173 
34 . 191 
33.184 
34.382 
34.799 
35. 043 
35.011 
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33 . 998 
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34.958 
35.910 
35 . 717 
35 . 864 
3 5 .723 
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33 .464 
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1.00 37.19 
1.00 38.93 
1.00 40.31 
1.00 40.42 
1.00 41.59 
1. 00 43 . 59 
1.00 43.46 
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1.00 43.43 
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1.00 49.87 
1.00 52.55 
1.00 56.84 
1.00 53.71 
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1.00 41.75 
1.00 41.55 
1.00 43.55 
1.00 41.26 
1.00 41.88 
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1.00 40.14 
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1.00 39.99 
1.00 39.99 
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1.00 39.43 
1.00 40.25 
1.00 42.32 
1.00 42.64 
1.00 38.52 
1.00 39.27 
1.00 35.63 
1.00 34.27 
1.00 32 .70 
1.00 32.40 
1.00 30.97 
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FIGURE 1 (cont.) 



ATOM 518 CA SER 2S3 

ATOM 519 CB SER 2 63 

ATOM 520 OG SER 2 63 

ATOM 521 C SER 2 63 

ATOM 522 0 SER 2 63 

ATOM 523 N PRO 2 64 

ATOM 524 CD PRO 264 

ATOM 525 CA PRO 264 

ATOM 526 CB PRO 264 

ATOM 527 CG PRO 2 64 

ATOM 528 C PRO 264 

ATOM 529 O PRO 2 64 

ATOM 53 0 N ILE 2 65 

ATOM 531 CA ILE 2 65 

ATOM 532 CB ILE 255 

ATOM 533 CG2 ILE 265 

ATOM 534 CGI ILE 255 

ATOM 535 CDl ILE 265 

ATOM 536 C ILE 265 

ATOM 537 O ILE 265 

ATOM 538 N THR 265 

ATOM 53 9 CA THR 2 55 

ATOM 540 CB THR 255 

ATOM 541 OGl THR 255 

ATOM 542 CG2 THR 265 

ATOM 543 C THR 255 

ATOM 544 O THR 25 5 

ATOM 545 N TYR 267 

ATOM 545 CA TYR 257 

ATOM 547 CB TYR 267 

ATOM 548 CG TYR 257 

ATOM 549 CDl TYR 267 

ATOM 550 CEl TYR 257 

ATOM 551 CD2 TYR 267 

ATOM 552 CE2 TYR 257 

ATOM 553 CZ TYR 257 

ATOM 554 OH TYR 257 

ATOM 555 C TYR 257 

ATOM 556 O TYR 257 

ATOM 557 N SER 258 

ATOM 558 CA SER 268 

ATOM 559 CB SER 268 

ATOM 560 OG SER 258 

ATOM 561 C SER 25 8 

ATOM 562 O SER 26 8 

ATOM 563 N THR 269 

ATOM 554 CA THR 269 

ATOM 565 CB THR 259 

ATOM 566 OGl THR 269 

ATOM 557 CG2 THR 259 

ATOM 568 C THR 259 

ATOM 569 0 THR 259 

ATOM 57 0 N TYR 27 0 

ATOM 571 CA TYR 27 0 

ATOM 572 CB TYR 27 0 

ATOM 573 CG TYR 27 0 

ATOM 57 4 CDl TYR 27 0 

ATOM 57 5 CSi TYR 27 0 



7.571 11.282 32.851 1.00 28.52 

8.148 12.705 32.871 1.00 27.84 

7.275 13.641 32.251 1.00 26.21 

7.285 10.852 31.420 1.00 27.13 

8.092 10.151 30.796 1.00 25.11 

S.091 11.189 30.907 1.00 25.91 

4.919 11.756 31.593 1.00 24.44 

5.757 10.825 29.528 1.00 25.37 

4.252 11.072 29.466 1.00 24.70 

4.059 12.175 30.433 1.00 25.43 

6.516 11.761 28.575 1.00 25.19 

6.537 11.488 27.378 1.00 24.68 

7.009 12.874 29.113 1.00 25.11 

7.750 13.839 28.323 1.00 24.64 

7.395 15.284 28.689 1.00 24.85 

8.045 16.250 27.706 1.00 24.53 

5.380 15.463 28.623 1.00 25.72 

5.398 16.810 29.116 1.00 27.30 

9.257 13.646 28.498 1.00 23.55 

9-756 13.474 29.612 1.00 23.53 

9.974 13.683 27.386 1.00 23.74 

11.419 13.510 27.391 1.00 22.43 

11.799 12.115 26.833 1.00 21.23 

11.288 11.107 27.707 1.00 20.89 

13.295 11.952 25.716 1.00 21.86 

12.060 14.585 25.530 1.00 21.75 

11.674 14.770 25.376 1.00 20.29 

12-981 15.341 27.119 1.00 21.86 

13.702 16.372 25.388 1.00 22.28 

13.833 17.654 27.211 1.00 22.75 

12.563 18.464 27.278 1.00 24.36 

11.566 18.297 28.327 1.00 22.30 

10-493 19.034 28.386 1.00 22.80 

12.253 19.396 26.281 1.00 23.65 

11.080 20.139 25.332 1.00 24.13 

10.205 19.949 27.391 1.00 23.99 

9.045 20.674 27.455 1.00 23.95 

15.078 15.805 25.108 1.00 22.43 

15.679 15.165 26.978 1.00 23.25 

15.573 15.031 24.900 1.00 22.28 

16.384 15.532 24.512 1.00 22.21 

15.750 14.082 24.010 1.00 22.96 

17.948 13.556 23.441 1.00 19.87 

17.484 15.401 23.419 1.00 22.55 

16-774 17.111 22.706 1.00 23.40 

18.309 16.391 23.348 1.00 23.70 

19.547 17.102 22.310 1.00 22.50 

21.031 17.245 22.594 1.00 22.70 

21.510 15.986 23.193 1.00 19.49 

21.218 18.313 23.744 1.00 20.70 

19.495 16.149 21.110 1.00 22.45 

19.222 14.950 21.259 1.00 22.71 

19.746 15.655 19.910 1.00 22.85 

19.745 15.777 18.747 1.00 22.07 

19.812 16.581 17.446 1.00 20.62 

18.535 17.330 17.123 1.00 19.62 

13.392 18.677 17.452 1.00 20.22 

17.242 19.339 17.118 1.00 18.93 
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FIGURE 1 (cont.) 
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FIGURE 1 (cont.) 



ATOM 634 0 GLY 

ATOM 63 5 N CME 

ATOM 63 5 CA CME 

ATOM 63 7 C CME 

ATOM 63 8 O CME 

ATOM 63 9 CB CME 

ATOM 64 0 SG CME 

ATOM 641 2SG CME 

ATOM 642 2CB CME 

ATOM 643 2CA CME 

ATOM 644 OG CME 

ATOM 545 N SER 

ATOM 64 5 CA SER 

ATOM 547 CB SER 

ATOM 548 OG SER 

ATOM 549 C SER 

ATOM 650 O SER 

ATOM 551 N GLY 

ATOM 652 CA GLY 

ATOM 653 C GLY 

ATOM 654 O GLY 

ATOM 555 N GLY 

ATOM 65 5 CA GLY 

ATOM 657 C GLY 

ATOM 558 O GLY 

ATOM 659 N ALA 

ATOM 660 CA ALA 

ATOM 551 CB ALA 

ATOM 6 62 C ALA 

ATOM 563 O ALA 
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278 16.005 6.611 

279 15.279 6.255 
279 13.973 5.792 
279 13.975 4.530 
279 13.004 4.243 
279 13.043 5.619 
279 12.348 7.190 
279 11.574 8.053 
279 9.954 7.230 
279 9.943 5.999 

279 10.230 6.331 

280 15.075 3.788 
280 15.171 2.556 
280 15.161 1.604 
280 17.471 2.145 
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281 16.014 4.127 
281 15.479 4.540 
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284 8.285 6.655 
284 7.590 7.930 
284 8.536 9.068 
284 9.898 8.953 
284 11.056 8.484 
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285 2.754 7.540 
285 2.833 6.357 
285 2.537 7.822 
285 4.365 9.942 

285 4.021 9.982 

286 4.864 10.991 
286 5.025 12.289 
236 4.036 13.310 
286 4.185 14.677 
236 2.601 12.776 
286 1.539 13.564 
236 5.458 12.804 
286 7.052 12.730 



19.037 1.00 28.84 

16.955 1.00 27.25 

17.394 1.00 28.61 

18.265 1.00 30.15 
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FIGURE 1 (cont.) 
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FIGURE 1 (cont.) 



ATOM 750 OG SER 294 

ATOM 751 C SER 294 

ATOM 752 0 SER 294 

ATOM 753 N THR 295 

ATOM 754 CA THR 295 

ATOM 755 CB THR 295 

ATOM 756 OGl THR 295 

ATOM 757 CG2 THR 295 

ATOM 758 C THR 295 

ATOM 759 O THR 295 

ATOM 760 N ASP 296 

ATOM 761 CA ASP 296 

ATOM 762 CB ASP 296 

ATOM 763 CG ASP 296 

ATOM 764 ODl ASP 296 

ATOM 765 0D2 ASP 296 

ATOM 766 C ASP 296 

ATOM 767 O ASP 296 

ATOM 768 N ALA 297 

ATOM 76 9 CA ALA 297 

ATOM 770 CB ALA 297 

ATOM 771 C ALA 297 

ATOM 772 O ALA 297 

ATOM 773 N THR 298 

ATOM 774 CA THR 298 

ATOM 775 CB THR 298 

ATOM 775 OGl THR 298 

ATOM 777 CG2 THR 298 

ATOM 778 C THR 298 

ATOM 779 O THR 298 

ATOM 780 N SER 299 

ATOM 781 CA SER 299 

ATOM • 782 CB SER 299 

ATOM 783 OG SER 299 

ATOM 784 C SER 299 

ATOM 785 O SER 299 

ATOM 786 N ILE 300 

ATOM 787 CA ILE 300 

ATOM 788 CB ILE 300 

ATOM 789 CG2 ILE 300 

ATOM 790 CGI ILE 300 

ATOM 791 CDl ILE 300 

ATOM 792 C ILE 300 

ATOM 793 0 ILE 300 

ATOM 794 N LEU 301 

ATOM 795 CA LEU 301 

ATOM 796 CB LEU 301 

ATOM 7 97 CG LEU 3 01 

ATOM 798 CDl LEU 301 

ATOM 799 CD2 LEU 3 01 

ATOM 800 C LEU 301 

ATOM 801 O LEU 301 

ATOM 802 N GLY 3 02 

ATOM 803 CA GLY 3 02 

ATOM 804 C GLY 302 

ATOM 805 O GLY 302 

ATOM 806 N ILE 303 

ATOM 807 CA ILE 303 



18.708 24.650 14.930 1.00 25.79 

19.767 23.878 11.441 1.00 24.00 

19.379 22.774 11.061 1.00 24.59 

20.845 24-473 10.946 1.00 24.29 

21.590 23.866 9.868 1.00 24.56 

21.764 24.855 8.697 1.00 25.03 

22.307 26.091 9.175 1.00 24.69 

20.411 25.132 8.038 1.00 24.54 

22.916 23.247 10.297 1.00 25.11 

23.773 22.952 9.467 1.00 25.90 

23.083 23.035 11.596 1.00 24.58 

24.297 22.405 12.097 1.00 25.65 

24.430 22.642 13.593 1.00 28.87 

23.307 22.010 14.372 1.00 32.27 

22.182 22.547 14.308 1.00 34.23 

23.541 20.956 15.009 1.00 35.53 

24.142 20.904 11.805 1.00 24.87 

23.027 20.374 11.883 1.00 24.43 

25.251 20.220 11.524 1.00 23.24 

25.228 18.795 11.185 1.00 22.47 

26.633 18.247 11.066 1.00 20.82 

24.414 17.923 12.122 1.00 21.78 

23.643 17.079 11.672 1.00 22.68 

24.571 18.134 13.420 1.00 22.79 

23.046 17.338 14.404 1.00 24.28 

24.334 17.649 15.813 1.00 24.90 

25.767 17.588 15.822 1.00 26.90 

23.780 16.639 15.800 1.00 24.31 

22.320 17.460 14.321 1.00 21.89 

21.614 16.458 14.441 1.00 21.95 

21.817 18.665 14.074 1.00 20.47 

20.373 18.865 13.969 1.00 19.52 

20.025 20.352 14.038 1.00 16.97 

20.393 20.874 15.300 1.00 15.99 

19.835 18.243 12.688 1.00 18.61 

18.810 17.570 12.698 1.00 18.83 

20.557 18.418 11.597 1.00 18.65 

20.155 17.863 10.322 1.00 19.88 
21.085 18.343 9.190 1.00 20.48 
20.691 17.693 7.846 1.00 19.34 

21.022 19.867 9.098 1.00 19.44 
22.063 20.435 8.226 1.00 20.43 
20.135 16.337 10.358 1.00 20.31 
19.194 15.717 9.867 1.00 21.85 
21.161 15.733 10.950 1.00 20.00 
21.243 14.280 11.022 1.00 19.61 
22.526 13.852 11.516 1.00 20.91 
22.985 12.379 11.326 1.00 23.01 

23.023 12.031 9.851 1.00 21.44 
24.336 12.111 11.964 1.00 24.75 

20.156 13.742 11.942 1.00 19.65 
19.517 12.729 11.642 1.00 19.47 
19.929 14.450 13.044 1.00 19.18 
18.919 14.048 14.002 1.00 18.63 
17.522 14.119 13.425 1.00 19.58 
16.731 13.194 13.606 1.00 20.15 
17.220 15.206 12.719 1.00 19.27 
15.906 15.383 12.113 1.00 19.24 
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FIGURE 1 (cont.) 



ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM . 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 



808 
809 
810 
811 
812 
813 
814 
815 
816 
817 
818 
819 
320 
821 
822 
823 
824 
825 
826 
827 
828 
829 
830 
831 
832 
833 
834 
835 
836 
837 
838 
839 
840 
841 
842 
843 
844 
845 
846 
847 
848 
849 
850 
851 
852 
853 
854 
855 
856 
857 
858 
859 
860 
861 
862 
863 
864 
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CB ILE 
CG2 ILE 
CGI ILE 
GDI ILE 
C ILE 
0 ILE 
N GLY 
CA GLY 
C GLY 
O GLY 
N THR 
CA THR 
CB THR 
OGl THR 
CG2 THR 
C THR 
0 THR 
N VAL 
CA VAL 
CB VAL 
CGI VAL 
CG2 VAL 
C VAL 
O VAL 
N LEU 
CA LEU 
CB LEU 
CG LEU 
CDl LEU 
CD2 LEU 
C LEU 
O LEU- 
N ASP 
CA ASP 
CB ASP 
CG ASP 
ODl ASP 
0D2 ASP 
C ASP 
O ASP 
N GLN 
CA GLN 
CB GLN 
CG GLN 
CD GLN 
OEl GLN 
NE2 GLN 
C GLN 
O GLN 
N ALA 
CA ALA 
CB ALA 
C ALA 
O ALA 
N GLU 
CA GLU 
C3 GLU 
CG GLU 



303 
303 
303 
303 
303 
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309 
309 
309 
310 
310 
310 
310 
310 
311 
311 
311 
311 



15.732 
14.466 
15.646 
15 .483 
15.675 
14.631 
16.562 
16.547 
16.323 
15.555 
17.009 
16.854 
17.722 
19.101 
17.494 
15 .377 
14.734 
14. 830 
13.423 
13 .022 
11.516 
13 .748 
12 .4-99 
11.629 
12 .701 
11.855 

12 .238 
11.804 
12 .361 
10.293 
11.891 
10 . 875 
13 . 051 

13 .184 
14.555 
14.859 
14.468 
15.403 
12 . 617 
11.878 
12.927 
12.529 
13 . 609 
14.985 
15.023 
15.702 
17.276 
11.169 
10._580 
10.698 

9 .432 
8.983 
8 .251 
7 .831 
7.714 

5 . 560 

6 . 125 
4.847 



16.799 
16.879 
17.818 
19.232 
14.324 
13 .685 
14.120 
13 .110 
11.730 
10.930 
11.435 
10.150 
10.087 
10.155 
8 .794 
9.988 
9.028 
10.974 
10.954 
12.258 
12.293 
12.380 
Lil-773 
9.908 
11.581 
11.513 
12 .605 
14.002 
15.033 
14.074 
10-155 
9.561 
9.522 
8.221 
7.938 
6.634 
6.559 
5.682 
7.111 
6.259 
7.159 
6.130 
S.003 
5.638 
5.622 
5.805 
5.415 
6.173 
5 .116 
7.367 
7 .554 
9.012 



11.518 
10.680 
12.653 
12 .199 
11.057 
11.044 
10-191 
9.160 
9.765 
9-226 
10.869 
11.552 
12 .830 
12.458 
13 .603 
11.919 
11.504 
12.632 
13 .039 
13 .751 
13 .977 
15.061 
11.837 
11.840 
10.805 
9.622 
8.625 
9.057 
8 .111 
9.095 
8 .960 
8 .490 
8 .971 
8 .356 
8 .062 
7.316 
6.136 

7 .907 
9.230 

8 .752 
10 .518 
11.455 
12.538 
12.022 

13 .128 

14 .301 
12.760 
12.168 
12.401 

12 .537 

13 .273 
13 .224 

12 .956 

13 .814 
11.742 
11.376 

9 . 932 



1.00 18.46 
1.00 19.48 
1.00 18.58 
1.00 19.42 
1.00 19.21 
1.00 21.27 
1.00 20.04 
1.00 20.49 
1.00 21.95 
1.00 22.92 
1.00 22.89 
1.00 23.21 
1.00 22.58 
1.00 26.30 
1.00 20.55 
1.00 23.59 
1.00 22.98 
1.00 24-39 
1.00 22.55 
1.00 19.96 
1.00 19.45 
1.00 18.75 
1.00 21.61 
1.00 22.37 
1.00 21.43 
1-00 21.34 
1.00 19-20 
1-00 17.60 
1.00 17.57 
1.00 16.93 
1.00 22.33 
1.00 24.71 
1.00 23.84 
1.00 23.32 
1.00 23.50 
1.00 23.73 
1.00 25.35 
1.00 22.72 
1.00 23.73 
1.00 24.61 
1.00 23.54 
1.00 24.40 
1.00 24.46 
1.00 24.18 
1.00 23.84 
1.00 24.42 
1.00 25.37 
1.00 26.09 
1.00 25.68 
1.00 27.06 
1.00 28.07 
1.00 27.02 
1.00 28.11 
1.00 28.35 
1.00 29.49 
1.00 31.72 
1.00 31.31 
1.00 33.59 
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FIGURE 1 (cont.) 
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FIGURE 1 (cont.) 



ATOM 924 O VAL 319 

ATOM 925 N LEU 32 0 

ATOM 926 CA LEU 320 

ATOM 927 CB LEU 320 

ATOM 928 CG LEU 320 

ATOM 929 CDl LEU 320 

ATOM 930 CD2 LEU 320 

ATOM 931 C LEU 320 

ATOM 932 O LEU 320 

ATOM 933 N ALA 321 

ATOM 934 CA ALA 321 

ATOM 935 CB ALA 321 

ATOM 936 C ALA 321 

ATOM 937 O ALA 321 

ATOM 938 N THR 322 

ATOM 939 CA THR 322 

ATOM 940 CB THR 322 

ATOM 941 OGl THR 322 

ATOM 942 CG2 THR 322 

ATOM 943 C THR 322 

ATOM 944 O THR 322 

ATOM 945 N ALA 323 

ATOM 946 CA ALA 323 

ATOM 947 CB ALA 323 

ATOM 948 C ALA 323 

ATOM 949 O ALA 323 

ATOM 950 N THR 324 

ATOM 951 CA THR 324 

ATOM 952 CB THR 324 

ATOM 953 OGl THR 324 

ATOM 9 54 CG2 THR 324 

ATOM 955 C THR 324 

ATOM 956 O THR 324 

ATOM 957 N PRO 325 

ATOM 958 CD PRO 325 

ATOM 959 CA PRO 325 

ATOM 960 CB PRO 325 

ATOM 961 CG PRO 325 

ATOM 962 C PRO 325 

ATOM 963 O PRO 325 

ATOM 964 N PRO 326 

ATOM 965 CD PRO 326 

ATOM 965 CA PRO 326 

ATOM 967 CB PRO 326 

ATOM 968 CG PRO 326 

ATOM 959 C PRO 326 

ATOM 970 0 PRO 326 

ATOM 971 N GLY 327 

ATOM 972 CA GLY 327 

ATOM 973 C GLY 327 

ATOM 974 O GLY 327 

ATOM 975 N SER 328 

ATOM 976 CA SER 328 

ATOM 977 CB SER 328 

ATOM 978 OG SER 328 

ATOM 979 C SER 328 

ATOM 980 0 SER 328 

ATOM 981 N VAL 329 



8.520 18.535 15.127 1.00 19.92 

7.739 19.555 13.226 1.00 20.09 

8.946 20.323 12.934 1.00 18.99 

9.276 20.236 11.442 1.00 18.78 

9.265 18.835 10.803 1.00 20.01 

9.449 18.951 9.309 1.00 20.04 

10.342 17.940 11.400 1.00 20.17 

8.609 21.754 13.352 1.00 17.86 

7.887 22.459 12.655 1.00 17.59 

9.102 22.160 14.515 1.00 18.47 

8.807 23.476 15.055 1.00 20.27 

8.355 23.354 16.522 1.00 19.53 

9.926 24.499 14.935 1.00 22.46 

11.105 24.199 15.192 1.00 22.78 
9.531 25.732 14.615 1.00 23.57 

10.463 26.839 14.463 1.00 24.42 

11.384 26.632 13.224 1.00 23.87 

12.358 27.677 13.159 1.00 22.26 

10.561 26.506 11.935 1.00 20.90 

9.721 28.156 14.301 1.00 25.40 

8.632 28.227 13.719 1.00 25.14 

10.317 29.220 14.841 1.00 25.81 

9.768 30.559 14.729 1.00 25.79 

10.219 31.412 15.902 1.00 25.93 

10.288 31.153 13.423 1.00 26.98 

9.758 32.153 12.932 1.00 28.83 

11.325 30.531 12.852 1.00 25.90 

11.925 31.003 11.617 1.00 24.88 

13.319 31.611 11.875 1.00 24.01 

14.106 30.689 12.541 1.00 25.50 
13.201 32.908 12.648 1.00 23.67 
12.048 29.943 10.508 1.00 24.40 
13.123 29.369 10.293 1.00 24.97 
10.935 29.624 9.829 1.00 22.47 

9.557 30.088 10.041 1.00 21.14 

10.978 28.534 8.755 1.00 22.40 

9.500 28.446 8.422 1.00 20.85 

8.922 29.759 8.720 1.00 19.94 

11.755 29.190 7.559 1.00 23.10 

12.008 30.396 7.486 1.00 23.30 

12.159 28.319 5.509 1.00 23.99 

12.034 26.849 6.583 1.00 23.51 

12.908 28.793 5.437 1.00 23.51 

12.949 27.553 4.544 1.00 22.50 

13.058 26.455 5.545 1.00 22.79 

12.245 29.985 4.740 1.00 22.55 

11.019 30.087 4.696 1.00 23.41 

13.070 30.901 4.249 1.00 20.16 

12.576 32.077 3.561 1.00 20.04 

12.224 33.207 4.501 1.00 20.56 

11.581 34.216 4.075 1.00 21.64 

12.544 33.044 5.779 1.00 22.07 

12.245 34.056 6.783 1.00 21.35 

12.379 33.455 8.189 1.00 20.72 

11.325 32.565 8.454 1.00 20.21 

13.088 35.322 6.689 1.00 22.21 

14.185 35.333 6.115 1.00 21.32 

12.552 36.387 7.275 1.00 21.78 
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FIGURE 1 (cont.) 



ATOM 982 CA VAL 

ATOM 983 CB VAL 

ATOM 984 CGI VAL 

ATOM 985 CG2 VAL 

ATOM 98 6 C VAL 

ATOM 987 0 VAL 

ATOM 98 3 N THR 

ATOM 989 CA THR 

ATOM 990 CB THR 

ATOM 991 OGl THR 

ATOM 992 CG2 THR 

ATOM 993 C THR 

ATOM 994 O THR 

ATOM 995 N VAL 

ATOM 996 CA VAL 

ATOM 997 CB VAL 

ATOM 998 CGI VAL 

ATOM 999 CG2 VAL 

ATOM 1000 C VAL 

ATOM 1001 O VAL 

ATOM 1002 N PRO 

ATOM 1003 CD PRO 

ATOM 1004 CA PRO 

ATOM 1005 CB PRO 

ATOM 100 5 CG PRO 

ATOM 1007 C PRO 

ATOM 1003 O PRO 

ATOM 1009 N HIS 

ATOM 1010 CA HIS 

ATOM 1011 CB HIS 

ATOM 1012 CG HIS 

ATOM 1013 CD2 HIS 

ATOM 1014 NDl HIS 

ATOM 1015 CEl HIS 

ATOM 1016 NE2 HIS 

ATOM 1017 C HIS 

ATOM 1018 0 HIS 

ATOM 1019 N PRO 

ATOM 102 0 CD PRO 

ATOM 1021 CA PRO 

ATOM 1022 CB PRO 

ATOM 1023 CG PRO 

ATOM 1024 C PRO 

ATOM 1025 O PRO 

ATOM 1026 N ASN 

ATOM 1027 CA ASN 

ATOM 1028 CB ASN 

ATOM 1029 CG ASN 

ATOM 103 0 ODl ASN 

ATOM 1031 ND2 ASN 

ATOM 1032 C ASN 

ATOM 1033 O ASN 

ATOM 1034 N ILE 

ATOM 103 5 CA ILE 

ATOM 103 6 CB ILE 

ATOM 1037 CG2 ILE 

ATOM 103 8 CGI ILE 

ATOM 1039 CDl ILE 



329 13.207 37.678 

329 12.634 38.651 

329 13.039 38.203 

329 11.123 38.747 

329 12.960 38.192 

329 12.080 37.683 

330 13.786 39.132 
330 13.681 39.708 
330 15.073 40.131 
330 15.958 39.009 
330 14.997 40.577 
330 12.719 40.888 

330 13.062 42.014 

331 11.507 40.600 
331 10.415 41.558 
331 9.065 40.789 
331 7.385 41.620 
331 8.871 40.366 
331 10.431 42.356 

331 10.942 41.880 

332 9.923 43.601 
332 9.525 44.351 
332 9.868 44.471 
332 8.882 45.544 
332 9.299 45.772 
332 9.403 43.751 

332 8.539 42.879 

333 10.042 44.092 
333 9.747 43.526 
333 11.040 43.034 
333 10.864 42.519 
333 11.036 41.281 
333 10.494 43.327 
333 10.448 42.610 
333 10.774 41.366 
333 9.152 44.666 

333 9.615 45.804 

334 8.174 44.352 
334 7.664 43.006 
334 7.509 45.364 
334 6.622 44.508 
334 6.344 43.286 
334 8.458 46.225 

334 8.426 47.448 

335 9.283 45.564 
335 10.231 46.226 
335 10.616 45.294 
335 9.433 44.782 
335 8.393 45.429 
335 9.579 43.602 
335 11.547 46.575 

335 12.410 47.100 

336 11.732 45.513 
336 13.043 45.972 

335 13.771 45.706 

336 15.209 45.038 
335 13.786 44.509 
335 14.260 43.272 



7.325 1.00 21.67 

6.278 1.00 21.56 

4.367 1.00 21.42 

6.410 1.00 19.14 

8.737 1.00 21.95 

9.424 1.00 21.78 

9.187 1.00 23.09 

10.523 1.00 24.47 

11.047 1.00 24.34 

10.972 1.00 25.56 

12.490 1.00 23.96 

10.521 1.00 25.81 

10.134 1.00 25.19 

10.972 1.00 27.46 

11.029 1.00 29.93 

10.871 1.00 30.69 

11.359 1.00 33.40 

9.408 1.00 28.41 

12.330 1.00 30.65 

13.347 1.00 30.44 

12.296 1.00 31.80 

11.088 1.00 32.94 

13.469 1.00 32.90 

13.029 1.00 33.03 

11.616 1.00 32.19 

14.725 1.00 33.35 

14.573 1.00 33.22 

15.840 1.00 34.93 

17.149 1.00 35.65 

17.801 1.00 35.75 

19.196 1.00 35.91 

19.717 1.00 37.50 

20.250 1.00 35.51 

21.359 1.00 35.57 

21.063 1.00 37.41 

17.967 1.00 36.79 

17.878 1.00 37.38 

18.833 1.00 37.08 

19.111 1.00 36.74 

19.680 1.00 37.36 

20.589 1.00 38.15 

19.753 1.00 38.80 

20.521 1.00 37.12 

20.447 1.00 37.73 

21.330 1.00 36.19 

22.228 1.00 35.31 

23.385 1.00 36.70 

24.146 1.00 38.17 

24.192 1.00 41.02 

24.751 1.00 37.75 

21.640 1.00 33.43 

22.404 1.00 35.07 

20.325 1.00 30.99 

19.783 1.00 27.79 

19.242 1.00 26.56 

18.847 1.00 23.81 

20.312 1.00 24.10 

19.819 1.00 22.21 
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FIGURE 1 (com.) 
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FIGURE 1 (cont.) 



ATOM 1098 C THR 344 

ATOM 1099 0 THR 344 

ATOM 1100 N GLY 345 

ATOM 1101 CA GLY 345 

ATOM 1102 C GLY 345 

ATOM 1103 0 GLY 345 

ATOM 1104 N GLU 346 

ATOM 1105 CA GLU 346 

ATOM 1106 CB GLU 346 

ATOM 1107 CG GLU 346 

ATOM 1108 CD GLU 346 

ATOM 1109 OEl GLU 3 46 

ATOM 1110 0E2 GLU 346 

ATOM 1111 C GLU 346 

ATOM 1112 0 GLU 346 

ATOM 1113 N ILE 3 47 

ATOM 1114 CA ILE 347 

ATOM 1115 CB ILE 347 

ATOM 1116 CG2 ILE 347 

ATOM 1117 CGI ILE 347 

ATOM 1118 CDl ILE 347 

ATOM 1119 C ILE 3 47 

ATOM 1120 0 ILE 347 

ATOM 1121 N PRO 348 

ATOM 1122 CD PRO 348 

ATOM 1123 CA PRO 348 

ATOM 1124 CB PRO 348 

ATOM 1125 CG PRO 3 48 

ATOM 1125 C PRO 348 

ATOM 1127 O PRO 348 

ATOM 1128 N PHE 349 

ATOM 1129 CA PHE 349 

ATOM 1130 CB PHE 349 

ATOM 1131 CG PHE 349 

ATOM 1132 CDl PHE 349 

ATOM 1133 CD2 PHE 349 

ATOM 1134 CEl PHE 349 

ATOM 1135 CE2 PHE 349 

ATOM 1136 CZ PHE 349 

ATOM 1137 C PHE 349 

ATOM 1138 O PHE 349 

ATOM 1139 N TYR 350 

ATOM 1140 CA TYR 350 

ATOM 1141 CB TYR 3 50 

ATOM 1142 CG TYR 350 

ATOM 1143 CDl TYR 350 

ATOM 1144 CEl TYR 350 

ATOM 1145 CD2 TYR 350 

ATOM 1146 CE2 TYR 3 50 

ATOM 1147 CZ TYR 350 

ATOM 1148 OH TYR 350 

ATOM 1149 C TYR 3 50 

ATOM 1150 0 TYR 350 

ATOM 1151 N GLY 351 

ATOM 1152 CA GLY 351 

ATOM 1153 C GLY 3 51 

ATOM 1154 O GLY 351 

ATOM 1155 N LYS 3 52 



29.842 58.908 14.008 1.00 27.63 

28.884 58.225 14.396 1.00 27.69 

31.003 58.969 14.649 1.00 26.95 

31.211 58.260 15.890 1.00 26.75 

32.688 58.168 15.189 1.00 28.21 

33.512 58.276 15.285 1.00 28.29 

33.023 57.926 17.451 1.00 28.51 

34.414 57.832 17.863 1.00 29.04 

34.541 58.020 19.378 1.00 30.35 

34.291 59.439 19.880 1.00 32.52 

32.888 59.926 19.593 1.00 32.87 
31.929 59.174 19.854 1.00 32.23 
32.747 61.057 19.086 1.00 35.06 
35.034 56.523 17.479 1.00 28.20 
36.283 55.502 17.218 1.00 29.40 
34.321 55.431 17.457 1.00 28.52 

34.889 54.110 17.153 1.00 26.91 
34.453 53.035 13.192 1.00 27.90 
35.250 51.776 18.005 1.00 28.31 
34.645 53.537 19.620 1.00 29.70 
33.400 54.194 20.186 1.00 32.81 
34.551 53.550 15.781 1.00 24.52 
33.392 53.258 15.490 1.00 23.00 
35.559 53.398 14.914 1.00 23.86 
36.947 53.871 15.026 1.00 23.37 
35.302 52.854 13.582 1.00 24.53 
36.692 52.834 12.959 1.00 24.34 
37.338 54.030 13.588 1.00 23.28 
34.759 51.449 13.786 1.00 25.57 
35.290 50.696 14.605 1.00 25.45 
33.677 51.118 13.084 1.00 25.89 
33.041 49.812 13.216 1.00 24.50 
31.971 49.860 14.318 1.00 23.29 
31.358 48.522 14.632 1.00 25.07 
32.025 47.510 15.447 1.00 26.07 
30.111 48.175 14.119 1.00 25.08 
31.461 46.372 15.750 1.00 25.64 
29.535 46.945 14.411 1.00 25.25 
30.211 46.038 15.230 1.00 26.19 
32.437 49.321 11.904 1.00 23.33 
31.361 49.749 11.501 1.00 22.31 
33.154 48.418 11.247 1.00 23.43 
32.725 47.820 9.986 1.00 23.78 
31.636 46.759 10.230 1.00 21.13 
32.133 45.531 10.967 1.00 20.42 
32.302 45.552 12.344 1.00 19.57 
32.794 44.440 13.028 1.00 20.40 
32.465 44.353 10.279 1.00 21.51 
32.963 43.224 10.958 1.00 19.67 
33.126 43.282 12.335 1.00 20.56 
33.648 42.212 13.040 1.00 20.53 
32.280 48.823 8.920 1.00 25.07 
31.215 48.680 8.324 1.00 26.53 
33.118 49.817 8.656 1.00 25.52 
32.789 50.802 7.644 1.00 26.67 
32.054 52.000 8.205 1.00 27.75 
32.097 53.088 7.624 1.00 29.56 
31.332 51.788 9.300 1.00 28.35 
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FIGURE 1 (cont.) 
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1199 
1200 
1201 
1202 
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1204 
1205 
1206 
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CA LYS 
CB LYS 
CG LYS 
CD LYS 
CE LYS 
NZ LYS 
C LYS 
0 LYS 
N ALA 
CA ALA 
CB ALA 
C ALA 
O ALA 
N ILE 
CA ILE 
CB ILE 
CG2 ILE 
CGI ILE 
CDl ILE 
C ILE 
O ILE 
N PRO 
CD PRO 
CA PRO 
CB PRO 
CG PRO 
C PRO 
O PRO 
N LEU 
CA LEU 
CB LEU 
CG LEU 
CDl LEU 
CD2 LEU 
C LEU 
0 LEU 
N ALA 
CA ALA 
CB ALA 
C ALA 
O ALA 
N VAL 
CA VAL 
CB VAL 
CGI VAL 
CG2 VAL 
C VAL 
O VAL 
N ILE 
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360 



30.596 
29.182 
28.352 
28 .138 
27 .757 
26.677 
31.373 
32 .560 
30 .704 
31.353 
31.987 
30.332 
29.122 
30.816 
29.944 
30.136 
29.224 
29.864 
30.151 
30.259 
31.446 
29 .231 

27 . 876 
29.379 
28 .042 
27.507 
29.521 
28.665 
30.602 
30.821 
32.024 
33 .384 
34.515 
33.457 
29 .591 
29 .272 

28 .887 
27 .685 
27.072 
26.638 
26 . Oil 
26 .502 
25 .506 
25.119 
24.714 
25.266 
25 .881 
25.050 
27.121 
27.567 
28 . 582 
27.857 
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30.892 
28 . 159 
28 .437 
28 .304 
28.879 



52.858 
52.405 
51.994 
53 .145 
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51.594 
53 .197 
52.903 
53.311 
54.186 
55 .568 
54.172 
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53.977 
53.949 
52.664 
52.705 
51.409 
50.093 
55.147 
55.480 
55.890 
55 .817 
57.069 
57.794 
57.257 
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55.825 
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56.004 
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59.522 
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54.892 
56.119 
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53 .179 
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52.892 
51.982 
51.199 
52 .814 

52 .025 

53 .308 
52 .459 
54.616 
55 .171 



9.962 
10.332 
9.145 
8.185 
6.811 
6.876 
11.213 
11.313 
12.170 
13.407 
13 .257 
14.533 
14.297 
15.750 
16.903 
17.758 
18.989 
16.922 
17 . 660 
17.770 
17.980 
18.172 
17.501 
19.025 
18.851 
17.531 
20.463 
20.951 
21.133 
22.512 
23.110 
22.338 
23 .275 
23.568 
23.367 
24.257 
23.069 
23.813 
23.228 
23.873 
24.911 
22.791 
22.758 
21.298 
20.511 
20.597 
23 .590 
23 .797 
24. 074 
24.893 
24.136 
23.029 
23.500 
22.995 
26.237 
27.076 
25.449 
27.681 



1.00 26.46 
1.00 26.95 
1.00 28.09 
1.00 29.10 
1.00 31.84 
1.00 34.57 
1.00 25.15 
1.00 24.74 
1.00 26.04 
1.00 27.08 
1.00 26.55 
1.00 27.31 
1.00 29.07 
1.00 26.83 
1.00 26.78 
1.00 26.37 
1.00 23.34 
1.00 26.07 
1.00 26.73 
1.00 27.53 
1.00 27.83 
1.00 27.07 
1.00 26.53 
1.00 27.16 
1.00 27.57 
1.00 28.35 
1.00 27.80 
1.00 28.66 
1.00 27.31 
1.00 26.73 
1.00 27.84 
1.00 27.71 
1.00 27.97 
1.00 25.40 
1.00 25.92 
1.00 25.15 
1.00 26.25 
1.00 26.44 
1.00 25.83 
1.00 26.65 
1.00 26.75 
1.00 26.24 
1.00 26.40 
1.00 25.09 
1.00 23.26 
1.00 24.96 
1.00 26.35 
1.00 27.18 
1.00 25.51 
1.00 24.89 
1.00 22.23 
1.00 21.17 
1.00 21.55 
1.00 17.63 
1.00 25.79 
1.00 25.36 
1.00 29.45 
1.00 31.32 
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FIGURE 1 (cont.) 



ATOM 1214 CB ALA 350 

ATOM 1215 C ALA 3 50 

ATOM 1215 O ALA 3 60 

ATOM 1217 N GLY 3 51 

ATOM 1218 CA GLY 3 51 

ATOM 1219 C GLY 3 51 

ATOM 1220 0 GLY 361 

ATOM 1221 M GLY 3 62 

ATOM 1222 CA GLY 3 52 

ATOM 1223 C GLY 3 52 

ATOM 1224 O GLY 3 52 

ATOM 1225 N ARG 3 53 

ATOM 1226 CA ARG 3 53 

ATOM 1227 CB ARG 3 53 

ATOM 1228 CG ARG 3 53 

ATOM 1229 CD ARG 3 53 

ATOM 1230 NE ARG 3 53 

ATOM 1231 CZ ARG 3 53 

ATOM 1232 NHl ARG 3 53 

ATOM 1233 NH2 ARG 3 53 

ATOM 1234 C ARG 3 53 

ATOM 1235 O ARG 363 

ATOM 123 6 N HIS 3 54 

ATOM 12 37 CA HIS 3 64 

ATOM 12 38 CB HIS 3 64 

ATOM 1239 CG HIS 364 

ATOM 1240 CD2 HIS 3 64 

ATOM 1241 NDl HIS 3 64 

ATOM 1242 CEl HIS 3 64 

ATOM 1243 NE2 HIS 3 64 

ATOM 1244 C HIS 3 64 

ATOM 1245 0 HIS 3 54 

ATOM 1246 N LEU 3 65 

ATOM 1247 CA LEU 3 55 

ATOM 1248 CB LEU 3 65 

ATOM 1249 CG LEU 3 65 

ATOM 1250 CDl LEU 3 65 

ATOM 1251 CD2 LEU 3 65 

ATOM 1252 C LEU 3 65 

ATOM 1253 O LEU 3 65 

ATOM 1254 N ILE 3 66 

ATOM 1255 CA ILE 3 66 

ATOM 1256 CB ILE 3 66 

ATOM 1257 CG2 ILE 3 66 

ATOM 1258 CGI ILE 3 56 

ATOM 1259 CDl ILE 366 

ATOM 12 50 C ILE 3 66 

ATOM 1251 O ILE 3 66 

ATOM 12 52 N PHE 3 67 

ATOM 12 63 CA PHE 3 57 

ATOM 1264 CB PHE 3 67 

ATOM 12 65 CG PHE 3 67 

ATOM 12 56 CDl PHE 3 67 

ATOM 1257 CD2 PHE 3 57 

ATOM 12 58 CSl PHE 3 67 

ATOM 12 69 CE2 PHE 3 57 

ATOM 1270 CZ PHE 3 67 

ATOM 1271 C PHE 367 



28.880 56.691 27.621 1.00 30.01 

28.207 54.584 23.972 1.00 32.84 

28.889 54.359 29.950 1.00 33.70 

26.880 54.659 28.981 1.00 32.93 

26.172 54.179 30.154 1.00 32.53 

25.285 53.020 29.739 1.00 32.16 

24.892 52.945 28.579 1.00 33.01 

24.978 52.115 30.564 1.00 31.44 

24.124 50.980 30.352 1.00 28.17 

24.894 49.725 29.979 1.00 28.18 
26.128 49.699 30.060 1.00 28.64 
24.168 48.677 29.597 1.00 25.54 
24.762 47.404 29.196 1.00 25.19 
24.135 45.256 29.985 1.00 25.16 
23.726 46.531 31.401 1.00 25.44 
23.093 45.460 32.123 1.00 29.20 
24.075 44.760 32.935 1.00 34.10 
24.655 43.608 32.616 1.00 35.13 
24.358 42.977 31.494 1.00 34.77 
25.613 43.130 33.389 1.00 36.31 
24.504 47.226 27.695 1.00 24.94 
23.377 47.393 27.222 1.00 24.76 
25.542 46.910 26.934 1.00 24.18 
25.385 46.776 25.494 1.00 23.64 
25.891 48.048 24.830 1.00 23.19 
25.367 49.291 25.478 1.00 25.11 

25.895 50.084 26.438 1.00 24.33 
24.116 49.803 25.205 1.00 25.29 
23.894 50.855 25.970 1.00 23.44 
24.959 51.047 26.726 1.00 25.09 
26.133 45.559 24.993 1.00 23.60 
27.098 45.128 25.528 1.00 24.11 

25.645 44.976 23.900 1.00 22.16 
25.247 43.784 23.310 1.00 21.25 
25.241 42.535 23.316 1.00 20.31 
25.667 41.278 22.749 1.00 19.06 
26.952 40.821 23.413 1.00 18.55 
24.562 40.252 22.966 1.00 16.61 
26.672 44.079 21.883 1.00 21.19 
25.836 44.377 21.031 1.00 21.03 
27.966 43.978 21.615 1.00 20.98 
28.473 44.255 20.283 1.00 21.20 

29.646 45.254 20.322 1.00 21.57 
30.113 45.572 18.911 1.00 19.27 
29.225 46.535 21.051 1.00 21.36 
30.303 47.607 21.078 1.00 20.61 
28.925 42.974 19.605 1.00 21.06 
29.825 42.296 20.093 1.00 20.15 
28.274 42.635 18.495 1.00 20.98 
28.610 41.438 17.747 1.00 21.40 
27.354 40.804 17.125 1.00 20.48 
26.556 39.975 18.093 1.00 20.75 
25.393 40.487 18.671 1.00 20.80 
26.951 38.575 18.416 1.00 19.09 
24.633 39.711 19.557 1.00 19.99 
26.210 37.399 19.291 1.00 19.22 
25.043 38.417 19.865 1.00 20.05 
29.644 41.705 16.670 1.00 22.44 
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FIGURE 1 (cont) 
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FIGURE 1 (cont.) 
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FIGURE 1 (cont.) 
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FIGURE 1 (cont) 
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FIGURE 1 (cont.) 
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FIGURE 1 (cont.) 
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407 30.548 45.178 

407 31.753 45.060 

408 29.773 44.155 
408 30.340 42.868 
408 29.371 41.733 
408 29.885 40.373 
408 29.178 41.757 
408 30.572 42.850 

408 29.686 43.231 

409 31.800 42.568 
409 32.137 42.483 
409 33.418 43.249 
409 33.718 43.135 
409 33.254 44.719 
409 32.308 40.978 

409 33.257 40.360 

410 31.392 40.397 
410 31.400 38.958 
410 30.431 38.351 
410 31.054 38.426 

410 30.686 39.176 

411 31.194 37.109 
411 30.827 36.404 
411 31.821 35.299 
411 31.853 34.331 
411 33.204 35.852 
411 29.504 35.711 

411 28.972 35.889 

412 29.016 34.851 
412 27.768 34.145 
412 27.239 33.567 
412 26.783 34.635 
412 26.071 35.534 
412 27.137 34.591 
412 27.868 33.064 

412 26.888 32.380 

413 29.055 32.920 
413 29.284 31.937 
413 30.724 31.974 
413 28.371 32.172 

413 27.812 31.222 

414 28.207 33.434 
414 27.368 33.772 
414 27.411 35.292 
414 26.880 35.779 
414 27.734 35.225 
414 25.855 37.290 
414 25.923 33.298 
414 25.585 32.245 
413 23.962 34.889 
418 24.665 35.935 
413 25.107 35.639 
418 24.847 37.228 
413 25.716 36.613 
413 25.457 38.212 
418 25.890 37.905 
418 21.732 35.511 
413 21.041 36.066 



26.659 1.00 17.42 

25.806 1.00 18.03 

25.351 1.00 17.57 

26.025 1.00 18.23 

26.449 1.00 19.02 

26.012 1.00 18.33 

27.947 1.00 17.66 

24.509 1.00 18.78 

23.742 1.00 18.45 

24.093 1.00 17.88 

22.679 1.00 17.88 

22.359 1.00 17.72 

20.884 1.00 19.24 

22.749 1.00 17.08 

22.435 1.00 18.24 

22.909 1.00 16.84 

21.668 1.00 18.77 

21.473 1.00 18.14 

22.485 1.00 18.34 

20.089 1.00 17.88 

19.181 1.00 18.45 

19.950 1.00 16.99 

18.738 1.00 17.24 

18.361 1.00 16.51 

19.409 1.00 18.59 

18.101 1.00 18.00 

19.103 1.00 19.33 

20.204 1.00 20.25 

18.221 1.00 21.45 

18.473 1.00 22.15 

17.165 1.00 19.65 

16.221 1.00 15.73 

16.684 1.00 16.40 

15.031 1.00 16.44 

19.549 1.00 24.32 

19.848 1.00 23.10 

20.125 1.00 28.20 

21.168 1.00 31.95 

21.611 1.00 31.34 

22.350 1.00 35.40 

22.905 1.00 38.88 

22.755 1.00 38.48 

23.911 1.00 40.49 

24.203 1.00 40.58 

25.570 1.00 39.61 

26.691 1.00 39.02 

25.652 1.00 38.26 

23.721 1.00 41.04 

24.309 1.00 41.21 

28.719 1.00 49.92 

29.554 1-00 49.95 

30.844 1.00 48.64 

29.063 1.00 49.25 

31.632 1.00 48.52 

29.846 1.00 49.33 

31.131 1.00 48.87 

29.710 1.00 48.72 

28.842 1.00 48.37 
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FIGURE 1 (cont.) 
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FIGURE 1 (cont.) 
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FIGURE 1 (cont.) 
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FIGURE 1 (cont.) 
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ATOM 2177 CA LEU 

ATOM 2178 CB LEU 

ATOM 2179 CG LEU 

ATOM 2180 CDl LEU 

ATOM 2181 CD2 LEU 

ATOM 2182 C LEU 

ATOM 2183 O LEU 

ATOM 2184 N CYS 

ATOM 2185 CA CYS 

ATOM 2186 CB CYS 

ATOM 2187 SG CYS 

ATOM 2188 C CYS 

ATOM 2189 0 CYS 

ATOM 2190 N GLU 

ATOM 2191 CA GLU 

ATOM 2192 CB GLU 

ATOM 2193 CG GLU 

ATOM 2194 CD GLU 

ATOM 2195 OEl GLU 

ATOM 2196 0E2 GLU 

ATOM' 2197 C GLU 

ATOM 2198 O GLU 

ATOM 2199 N CYS 



486 23.048 22.371 

486 20.822 23.226 

486 22.549 21.098 

486 20.311 21.959 

486 21.174 20.887 

486 24.004 25.089 

486 23.616 24.599 

487 25.282 25.232 
487 26.392 24.769 
487 27.674 25.435 
487 28.810 25.337 
487 28.767 26.041 
487 29.752 24.568 
487 26.590 23.244 

487 26.323 22.518 

488 27.134 22.781 
483 27.421 21.365 
488 27.972 21.165 
488 28.173 19.795 
488 28.421 20.838 

488 28.447 19.639 

489 29.227 21.728 
489 30.185 21.301 
489 31.078 22.459 
489 30.310 23.551 
489 29.422 20.737 

489 29.978 19.963 

490 28.156 21.129 
490 27.363 20.620 
490 26.038 21.396 
490 25.220 20.725 
490 26.314 22.853 
490 27.088 19.147 

490 27.114 18.341 

491 26.879 18.793 
491 26.636 17.397 
491 26.283 17.289 
491 25.006 18.026 
491 24.743 17.853 
491 23.829 17.495 
491 27.900 16.594 

491 27.843 15.522 

492 29.046 17.138 
492 30.317 16.491 
492 31.468 17.390 
492 33.106 16.704 
492 30.413 16.274 

492 30.801 15.205 

493 29.997 17.289 
493 30.028 17.262 
493 29.724 18.543 
493 30.165 18.820 
493 30.042 20.247 
493 29.269 21.030 
493 30.720 20.585 
493 29.074 15.229 

493 29.344 15.663 

494 27.975 15.969 



4.795 1.00 25.53 
5.034 1.00 23.72 
4.564 1.00 25.73 
4.804 1.00 25.05 
4.569 1.00 25.77 
3.141 1.00 24.60 
2.075 1.00 22.55 
3.465 1.00 25.33 
2.639 1.00 25.56 
3.150 1.00 25.82 
2.174 1.00 25.43 
1.155 1.00 26.83 
2.421 1.00 26.93 

2.582 1.00 26.12 
3.548 1.00 26.16 
1.462 1.00 26.57 
1.244 1.00 26.60 

-0.173 1.00 28.61 
-0.481 1.00 31.93 
2.286 1.00 26.05 

2.583 1.00 26.13 
2.861 1.00 24.65 
3.864 1.00 22.91 
4.300 1.00 21.25 
4.768 1.00 21.37 
5.062 1.00 23.17 
5.846 1.00 24.42 
5.216 1.00 21.89 
5.324 1.00 21.05 
6.522 1.00 21.77 
7.598 1.00 20.51 
5.931 1.00 20.48 
6.051 1.00 20.91 
5.973 1.00 22.47 
4.779 1.00 21.01 
4.390 1.00 21.10 
2.909 1.00 20.22 
2.505 1.00 22.49 
1.014 1.00 20.45 
3.329 1.00 21.46 
4.686 1.00 20.73 
5.292 1.00 22.35 
4.291 1.00 19.79 
4.556 1.00 19.65 
4.092 1.00 18.82 
4.401 1.00 20.38 
6.067 1.00 19.87 
5.537 1.00 20.08 
6.817 1.00 21.33 
8.274 1.00 22.71 
8.830 1.00 24.31 

10.259 1.00 28.02 
10.763 1.00 29.49 
10.165 1.00 31.62 
11.758 1.00 32.33 
3.862 1.00 21.21 
9.925 1.00 21.52 
8.161 1.00 20.48 
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FIGURE 1 (cont.) 
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FIGURE 1 (cont.) 
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501 33.319 13.706 
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503 28.585 4.337 
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507 25.079 2.968 
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FIGURE 1 (cont) 
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FIGURE 1 (cont.) 
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FIGURE 1 (cont.) 
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FIGURE 1 (cont.) 
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DNA COORDINATE.^ 

Atom 

Tyaa Resid ± S X Z Q££i B 

3226 03' THY 2 37.257 37.479 7.626 1.00 62.88 

3227 ? THY 3 38.537 37.587 8.598 1.00 54.59 

3228 01? THY 3 39.744 37.500 7.737 1.00 65.36 

3229 02P THY 3 38.349 38.772 9.478 1.00 54.52 

3230 05' THY 3 38.455 35.284 9.516 1.00 61.39 

3231 C2' THY 3 39.582 33.841 11.157 1.00 57.22 
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FIGURE 1 (cont.) 
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FIGURE 1 (cont.) 
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be material to patentability as defined in Title 37, Code of 
Federal Regulations, § 1.56. 

I hereby claim foreign priority benefits under Title 35, 
United States Code, § 119 of any foreign application (s) for 
patent or inventor's certificate listed below and have also 
identified below any foreign application for patent or 
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inventor's certificate having a filing date before that of 
the application on which priority is claimed: 



Prior Foreign Application (s) 

Priority 

Claimed 

__ 

(Number) (Country) 
No 



[ ] 

(Day/Month/Year Filed) Yes 



I hereby claim under Title 35, United States Code, § 119(e) 
of any United States provisional application ( s ) listed 
below: 



60/055, 772 August 13, 1997 abandoned 



(Appln Serial No.) (Filing Date) 

(Status) (patented, 

pending, abandoned) 



I hereby claim the benefit under Title 35, United States 
Code, § 120 of any United States application ( s ) , or § 365(c) 
of any PCT International application designating the United 
States, listed below and, insofar as the subject matter of 
each of the claims of this application is not disclosed in 
the prior United States application in the manner provided 
by the first paragraph of Title 35, United States Code, 
§ 112, I acknowledge the duty to disclose to the United 
States Patent and Trademark Office all information known by 
me to be material to patentability as defined in Title 37, 
Code of Federal Regulations, § 1.56 which became available 
between the filing date of the prior application and the 
national or PCT international filing date of this 
application; 



PCT/US 9 8/16879 August 13, 1998 Pending 

(Appln Serial No.) (Filing Date) 

(Status) (patented, 

pending, abandoned) 

As a named inventor, I hereby appoint the following 
attorneys or agents to prosecute this application and 
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transact all business in the United States Patent and 
Trademark Office connected therewith: 



James F. Haley, Jr., (Rea. No. 27.794) 



Andrew 


S. Marks, Esq. (Recf, 


No. 


33, 259) 


Barbarc 


I A. Ruskin (Rea. No 


39 


, 350) 


Lisa A 


Dixon (Rea. No. 40, 


395) 





Send correspondence to: James F. Haley, Jr., Esq. 

FISH & NEAVE 

1251 Avenue of the Americas 
New York, New York 10020-1104 

Direct telephone calls to: James F. Haley, Jr., Esq. 

(212) 596-9000 

I hereby declare that all statements made herein of my own 
knowledge are true and that all statements made on 
information and belief are believed to be true; and further 
that these statements were made with the knowledge that 
willful false statements and the like so made are punishable 
by fine or imprisonment, or both, under Section 1001 of 
Title 18 of the United States Code and that such willful 
false statements may jeopardize the validity of the 
application or any patent issued thereon. 



Full name of first inventor Joseph L. Kim 



First Inventor's signature 




Residence 20 Ridqe Avenue, Natick, Massachusetts 

Citizenship U.S. 

Post Office Address 20 Ridge Avenue, Natick, Massachusetts 
01760 



Full name of second joint inventor Kurt Moraenstern 

Second Inventor's signature ^^^^^^^^^^^^"IH 

Date 

Residence 17B Bonnie Lane, Perry, New Hampshire 

Citizenship U.S. 

Post Office Address 17B Bonnie Lane, Derrv, New Hampshire 
03038 



Full name of fourth inventor Paul Caron 

Fourth Inventor's signature y --^^ Q. ^±/i^Zl3 

Date 

Residence 85 Maple Street, Maiden. Massachusetts 

Citizenship U.S. 

Post Office Address 85 Maple Street. Maiden. Massachusetts 



:ivent£g ^ Chao/)Lin 



Full name of third joint inve 
Third Inventor's signature 

Date 

Residence 42 Stant on Road #2, Brookline, Massachusetts 

Citizenship U.S. 

Post Office Address 42 Stanton Road #2 , Brookline, 
Massachusetts 02445 



VPI97-101 CIP CON 



DECLARATION AND POWER OF ATTORNEY 
FOR PATENT APPLICATION 



As a below named inventor, I hereby declare that: 

My residence, post office address and citizenship are as 
stated below next to my name; 

I believe I am the original, first and sole inventor (if 
only one name is listed below) or an original, first and 
joint inventor (if plural names are listed below) of the 
subject matter which is claimed and for which a patent is 
sought on the invention entitled: 

CRYSTALS OF HEPATITIS C VIRUS HELICASE OR FRAGMENTS THEREOF 
COMPRISING A HELICASE BINDING POCKET 



the specification of which 

(check [x] is attached hereto 
one) 

[ ] was filed on as 

Application Serial No. and 

was amended on . 

(if applicable) 

I hereby state that I have reviewed and understand the 
contents of the above-identified specification, including 
the claims, as amended by any amendment referred to above. 

I do not know and do not believe that the invention was ever 
patented or described in any printed publication in any 
country before my or our invention thereof or more than one 
year prior to this application. 

I do not know and do not believe that the invention was in 
public use or on sale in the United States of America more 
than one year prior to this application. 

I acknowledge the duty to disclose to the United States 
Patent and Trademark Office all information known by me to 
be material to patentability as defined in Title 37, Code of 
Federal Regulations, § 1.56. 

I hereby claim foreign priority benefits under Title 35, 
United States Code, § 119 of any foreign application (s) for 
patent or inventor's certificate listed below and have also 
identified below any foreign application for patent or 
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inventor's certificate having a filing date before that of 
the application on which priority is claimed: 

Prior Foreign Application ( s ) 

Priority 

Claimed 



[ ] 

(Number) (Country) 
No 



[ ] 

(Day/Month/Year Filed) Yes 



I hereby claim under Title 35, United States Code, § 119(e) 
of any United States provisional application ( s ) listed 
below : 



60/055, 772 August 13, 1997 abandoned 



(Appln Serial No.) (Filing Date) 

(Status) (patented, 

pending, abandoned) 



I hereby claim the benefit under Title 35, United States 
Code, § 120 of any United States application ( s ) , or § 365(c) 
of any PCT International application designating the United 
States, listed below and, insofar as the subject matter of 
each of the claims of this application is not disclosed in 
the prior United States application in the manner provided 
by the first paragraph of Title 35, United States Code, 
§ 112, I acknowledge the duty to disclose to the United 
States Patent and Trademark Office all information known by 
me to be material to patentability as defined in Title 37, 
Code of Federal Regulations, § 1.56 which became available 
between the filing date of the prior application and the 
national or PCT international filing date of this 
application : 



PCT/US 98/16879 August 13, 1998 Pending 

(Appln Serial No.) (Filing Date) 

(Status) (patented, 

pending, abandoned) 

As a named inventor, I hereby appoint the following 

attorneys or agents to prosecute this application and 
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transact all business in the United States Patent and 
Trademark Office connected therewith: 



James F. Halev. Jr., (Reg. No. 27,794) 
Andrew S. Marks, Esq. (Reg. No. 33,259) 
Barbara A. Ruskin (Reg. No. 39,350) 
Lisa A. Dixon (Reg. No. 40,995) 



Send correspondence to: 



Direct telephone calls to: 



James F. Haley, Jr., Esg. 
FISH & NEAVE 

1251 Avenue of the Americas 
New York, New York 10020-1104 

James F. Haley, Jr., Esq. 
(212) 596-9000 



I hereby declare that all statements made herein of my own 
knowledge are true and that all statements made on 
information and belief are believed to be true; and further 
that these statements were made with the knowledge that 
willful false statements and the like so made are punishable 
by fine or imprisonment, or both, under Section 1001 of 
Title 18 of the United States Code and that such willful 
false statements may jeopardize the validity of the 
application or any patent issued thereon. 



Full name of first inventor Joseph L. Kim 

First Inventor's signature 

Date 

Residence 20 Ridge Avenue, Natick, Massachusetts 

Citizenship U.S. 

Post Office Address 20 Ridae Avenue, Natick, Massachusetts 
01760 
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Full name of second joint inventor ^^ Kurt Moraeng-tern 

Second Inventor's signature Tj^I^^''^^::^^^^.^^^^ 3/W/f 

^ Date 

Residence 17B Bonnie Lane. Perry. New Hampshire 

Citizenship U.S. 

Post Office Address 17B Bonnie Lane. Derrv. New Hampshire 



Full name of fourth inventor Paul Caron 

Fourth Inventor's signature 

Date 

Residence 85 Maple Street, Maiden. Massachusetts 

Citizenship U.S. 

Post Office Address 85 Maple Street. Maiden. Massachusetts 



Full name of third joint inventor Chao Lin 

Third Inventor's signature 

Date 

Residence 42 Stanton Road #2. Brookline, Massachusetts 

Citizenship U.S. 



Post Office Address 42 Stanton Road #2. Brookline 
Massachusetts 02445 



